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Preface 


We are delighted to invite you to participate in 2010 International conference on 
Electrical and Electronics Engineering (ICEEE 2010) in Wuhan, China, and 
December 4-5, 2010. The objective of ICEEE 2010 is to provide a forum for 
researchers, educators, engineers, and government officials involved in the general 
areas of Electrical and Electronics Engineering to disseminate their latest research 
results and exchange views on the future research directions of these fields. 

This year, 2010 International conference on Electrical and Electronics Engineering 
(ICEEE 2010) invites high-quality recent research results in the areas of Electrical 
and Electronics Engineering. 

The main goal of the Conference is to bring together scientists and engineers who 
work on Electrical and Electronics Engineering. The ICEEE conference will provide 
an opportunity for academic and industry professionals to discuss the latest issues and 
progress in the area of Electrical and Electronics Engineering. Furthermore, we expect 
that the conference and its publications will be a trigger for further related research 
and technology improvements in this important subject. 

ICEEE 2010 will also include presentations of contributed papers and state-of-the- 
art lectures by invited keynote speakers. The conference will bring together leading 
researchers, engineers and scientists in the domain of interest from around the world. 
We would like to thank the program chairs, organization staff, and the members of the 
program committees for their hard work. Special thanks go to Springer Publisher. 

We hope that ICEEE 2010 will be successful and enjoyable to all participants. We 
look forward to seeing all of you next year at the ICEEE 2011. 
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Link Building on Demand in Multi-robots 


Yi An Cui and En Ming Xiang 


School of Info-Physics Engineering, Central South University, Changsha 410083, China 
{csu-iag, enmingxiang}@mail.csu.edu.cn 


Abstract. To build effective communication link between two robots while 
physical link did not exist in multi-robots system, a strategy of active com- 
munication link building on demand was proposed. Based on some model 
assumptions and definitions, so called SSS strategy was presented to build 
communication link in multi-robot system. And the corresponding algorithms 
were discussed and compared in simulation. The simulation results indicated 
that communication link could be built successfully by employing these algo- 
rithms and the performances of algorithm could be analyzed quantitatively by 
using energy consumption and time consumption of robots in link building. 


Keywords: link building; multi-robots; simulation; algorithm comparing. 


1 Introduction 


Communications take a basic role in cooperation of robots with information exchanged 
[1]. At present, a great deal of research on how to keep connectivity of Ad hoc networks 
has been done in mobile robotics. But it is still a problem how to maintain correspon- 
dence reliably in robots while some key nodes do not work. Many methods of networks 
building was proposed only under optimal conditions without failure of robots. Howard 
etc. [2] presented a heuristic strategy to build networks, which is centralized and not 
robust. Pezeshkian etc. [3] let robots be formed up into a line by following a leader. The 
failure of robots was considered, but the connectivity of networks was not sure. Ulam 
etc. [4] discussed 4 different methods to rebuild networks under robot fault in mul- 
ti-robot system. These methods are limited illuminating but not applied. Vazquez etc. 
[5] and Anderson etc. [6] constructed and maintained networks based on some hypo- 
thesis, such as each robot has at least one neighbor and so on. 

But on some occasion, there is no physical link at all between two robots, any 
method and strategy mentioned as above is disabled and useless. So an active com- 
munication link building method on demand is presented for multi-robot system in 
unknown environments based on robot’s mobility. 


2 Problem Description 


In cooperative mission robot system, robot R, and R, can not move at will for task 
necessary. But some time they still have to exchange their message. For the reason of 
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limited radio range, they may not communicate with each other directly. In such cir- 
cumstances, some robots just lying between them act as relays to build communication 
link. And this is also an ordinary problem studied in Ad hoc networks. But if there are 
not such relays as shown in figure 1(a), any router discovery protocol would be disable 
because the link is not here at all. Then it is necessary to move some robots to desired 
position to act as relay between R, and R,. As shown in figure 1(b), 4 robots moved to 
corresponding place (showed by dotted line) and an active link was build. That is just 
the problem we will discuss in this paper, so called how to build link between any two 
robots on demand in multi-robot system. 


(a) R, and Re, can not communicate with each other (b) R,and R,communicate with each other by relays 


Fig. 1. Active linking in multi-robot system 


3 Model and Definition 


3.1 Basic Assumptions 


For ease of problem presentation, some assumptions about environment and robot were 
made as follows. 


Assumption 1: The environment is an ideal two-dimensional plane without any 
obstacle. 

Assumption | extricates us from path planning and obstacle avoiding, that is re- 
searched in many literatures about mobile robot. 

Assumption 2: All robots are dots and their size is neglected. 

Assumption 3: The communication distance of robot is limited to C,. 

Assumption 4: All robots trend group by instinct. The robot without mission 
would wander in the environment and try to stay nearby other robots. 

Assumption 5: The robot has distance and angle sensors as theodolite. 


3.2 Connectable Tree and Connectable Frontier 


Definition 1: Taking robot R; as a root node, a tree graph could be constructed by 
robots that can communicate with R; directly or indirectly as shown in figure 2. This 
tree is called connectable tree of R; , denoted as 7;(R;). 
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Definition 2: All leaf nodes in connectable tree are called connectable frontier, and the 
leaf node with maximum number of ranks is called the most connectable frontier, 
denoted as Fr(R;).As shown in figure 2, R2,R4,R5 and R7 are connectable frontiers of R;, 
and R7 is the most connectable frontier. 


Connectable Tree 


Fig. 2. Connectable tree of R; 


3.3 Space Coordinates and Communication Distance 


Definition 3: p,(p,6) denotes the polar coordinate of robot R; in its environment. If a 
robot is right on the origin (0, 0), it will be called origin robot, denoted as Ro. 


Definition 4: d,(R;, R;) denotes the space distance between robot R; and R;. 


Definition 5: d.(R;) denotes the maximum communication distance of R;, and d,(Rj, Rj) 
denotes the maximum communication distance between robot R; and R; directly. 


Then we can get the equation d.(R, Rj) =min(d(R;), d.(R;)) . And considered of As- 
sumption 3, another equation can be got: 


dA(Ri, Rj) = d (Ri) = d(Ri)=C 
3.4 Communication Link 


Definition 6: Beeline link consists of many collinear communication nodes, and forms 
a maximum connectable communication link. Beeline link can be denoted as Link{R), 


Ry, ..., R;}. 
Head Ly Tail Le 
B)—-®)—-®) CR) 


Fig. 3. Beeline link 


4 Y.A. Cui and E.M. Xiang 
Definition 7: The two ends of beeline link are called link-end. One is called head-end 
then the other is tail-end. 


Definition 8: Link length means the minimum space distance between the two 
link-ends on beeline link. It is denoted as d). 


In figure 3, d= d,(R;, Ri). 


Definition 9: Link-hop, denoted as /;, equals the number of link nodes subtracted 
by 1. 


In figure 3, h,= i-1. 


Definition 10: Increase a new node near the tail-end in a link to be a new tail-end, then 
a new communication link with d)+ C; link length is formed. This process is defined as 
link-increasing, denoted as follows. 


Link{}=Link{}+1 


Definition 11: Beeline link revolves round it’s head-end by an angle @ in a plane while 
the other parameters are fixed. That is called link-rotating, denoted as follows. 


Link{}=Rev(Link{},9) 
4 Active Communication Link Building 


In unknown environments, a strategy called Spiral-Step-Scan (SSS for short) is pro- 
posed to build communication link among multi-robot system. 


Ge ~S 
a 
¢ a aiare Xs 
ca Pa s ‘ 

f t ‘ \ 

1 rR Ri oa Riv 

\ \ f v7] 

‘ , )  Link-increasing 


ba 
\ wA-7 : > 
N Link-rotatin; 
ie 


. 
~ “7 


~-—=—= 


Fig. 4. SSS strategy 


The process of SSS strategy includes let R, be a head-end, combine other robots 
discovered one by one to form a beeline link, this link run link-increasing and 
link-rotating by turns. Then the tail-end of link will step in a spiral until R, is discov- 
ered, and the communication link between R, and R, will be build at the same time. The 
detailed algorithm is shown as follows. 
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Alg. 1. SSS algorithm 


Step 1. R, searchs connectable tree, if R, is discovered, goto step 10; 
Step 2. R, looks for a free robot R; in connectable tree; 
Step 3. Form a beeline link Link{R,, Ri; -7}; 
Step 4. h>z, failure, exit; 
Step 5. Set link-rotating counter Rev_Counter=1; 
Step 6. Rev_Counter=6h,, goto Step 9; 
Step 7. Tail-end L, searchs connectable tree, if R, is discovered, goto step 10; 
Step 8. Link{ R,, Ri -*}=Rev(Link{ Ry, Ri, °},0); (O=17/Gh)) 
Rev_Counter= Rev_Counter+1 ; 
Goto Step 6; 
Step 9. Link{R,, Rj -°:}= Link{R,, Ri, +-:}+1; 
Goto Step 4; 
Step 10. The link between R, and R, is build successfully. 


Lacking environmental information, SSS strategy is a blind search algorithm and is 
inefficient. It may cost much energy and time to build link based on this strategy in mul- 
ti-robot system. If we take full advantage of the most connectable frontier, the number of 
link-increasing and link-rotating would be reduced, and the link building would be more 
efficient. Based on definition 2 and definition 6, there is a corollary as follows. 


Corollary 1: If the communication link between Ri and it’s most connectable frontier 
FR(Ri) is a beeline link, then: 


d, (Ri, Fr(Rj))=max(d,(R;, Rj)), Rj CT (Ri) 
In SSS algorithm, change step 2 and Step 3 as follows. 


Alg. 2. SSS algorithm based on the most connectable frontier 
Step 2. R, looks for the most connectable frontier robot Fr(R;) in connectable tree; 
Step 3. Let R; =F r(R,), form a beeline link Link{R,,...,R;,...}; 
Theoretically, this improved algorithm should work with better performance for link 
building. 
5 Simulation and Analyzing 


To verify the validity of these algorithms, some simulations were implemented in 
multi-robots virtual grid environmental system which was developed by ourselves. 2-D 
local environment was set to be 88x88 grids, and the visible part were 40x62 grids. 11 
dot robots were put in the above environment randomly. Showed as figure 5, the dis- 
tance between R, and R, is 40 grids, denoted as d,(Rs, Rg)=40, and set C=10. All 
robots are free except R, and Ry. 
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Fig. 5. Multi-robots virtual grid environment 


The process of simulation was shown as figure 6. In the figure, (a), (b) and (c) are the 
main states of simulation with SSS algorithm. The main states of simulation with im- 
proved algorithm are shown as (d), (e) and (f). 


Fig. 6. Process of simulation 


Link Building on Demand in Multi-robots 


Detailed comparison of process with two algorithms was shown in table 1. 


Table 1. Comparison of process 


SSS Algorithm 


Form a beeline link Link{R,R,} and 
link-rotating one revolution, shown as Fig.6 
(a). 

Link-increasing to be Link{R,R),R.} and 
link-rotating one revolution, shown as Fig.6 
(b). 

Link-increasing to be Link{R,,R),R2,R5} and 


Improved SSS Algorithm 


Look for the most connectable frontier and 
form a beeline link Link{R,,R3,R4}, shown as 
Fig.6 (d). 

Link{R,R3,R4} link-rotating one revolution, 
shown as Fig.6 (e). 


Link-increasing to be Link{R,R),R2,R,} and 
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find Ry, shown as Fig.6 (c). find Ry, shown as Fig.6 (f). 


As the simulation results, a communication link between R, and R, was build suc- 
cessfully via R;, R2 and Rz by using both two algorithms. But the improved SSS algo- 
rithm had more optimal performance than SSS algorithm from two aspects. They were 
energy consumption and time consumption that could be given by our multi-robots 
virtual grid environmental system in the form of detailed data. In the system, energy 
consumption (TEC for short) was estimated depend on the total path length of all robots 
in one mission.And time consumption (77C for short) was the number of clock tempos 
that was occupied by robots to complete a mission. In the above simulation, compared 
to SSS algorithm, the improved one reduced 28.8% of the energy consumption and 
37% of the time consumption. 


6 Conclusion 


A concept about active communication link building on demand was proposed for 
multi-robot system in unknown environments. In order to realize active communication 
link building on demand, a basic SSS strategy was presented. And the corresponding 
algorithm even improved algorithm was discussed. The simulation result proved the 
validity of these strategy and algorithms by building link successfully between two 
robots on demand. By employed two parameters TEC and TTC, the improved algorithm 
was proved more optimal quantitatively. The future work should focus on improving 
the algorithm further by using some self-learning strategy. 
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Abstract. Condensate pump is employed to pump the water in condenser to 
deaerator, maintains the water height in deaerator and enables units operate 
continuously. Currently, system efficiency of 600 MW units is comparatively 
low due to high design pressure head and large throttling pressure loss. This pa- 
per takes NLT500-570x4S condensate pump as example, and analyzes its exist- 
ing problems based on performances test. Further, aims at solve these problems, 
some modification works, such as reducing the throttling pressure loss and re- 
forming the through-flow as well as improving the craft of inspection, are per- 
formed. The results show that the highest efficiency of the condensate pump 
was improved with 5.5% under the best design flux. 


Keywords: fluid machinery; efficiency; energy conservation renovation; 
condensation pump; performance test. 


1 Introduction 


Power plants rated at 600 MW and above are the main development tendency in Chi- 
na. The ratio of power plants rated at 600 MW and above is 48% among all the new 
founded power generation capacity in year 2006. But from January to September in 
2007, the ratio is increasing to 57% [1]. With the emphasis on energy saving in elec- 
tric power generation industry and the urgent need for electric in China, more and 
more large-capacity and high efficiency units will be in operation to replace the low 
capacity, low efficiency and high-polluting units. To guarantee the reliable operation 
in large capacity units is also important [2-4]. 

Condensate pump is the main auxiliary equipment of the turbine generator system, 
the function is to pump the condensation water heated by low pressure heater in 
condenser water tank to the deaerator, and keep the deaerator water level stable. Cur- 
rently, condensate pumps of 600 MW units are using fixed-speed pumps. During 
operation, units load and openings of adjusting valve vary in a wide range. Especially 
in low load condition, the throttling loss is numerous. In addition, the design is more 
conservative, and more emphasis on safety, less consideration on energy conserva- 
tion, these result in the large the design capacity in condensate pump. 

Therefore, condensate pump has great energy-saving potential when it operating 
in the large flow rate tolerance and frequently changed work conditions [5-6]. 
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In this paper, energy saving retrofit on condensate pump is processed according 
on the actual filed condition. 


2 Existing Questions 


In our plant, each 600MW unit is equipped with two NLT500-570 x 4S type conden- 
sate pumps made by Shanghai KSB Ltd., the designed parameters are shown in 
Table 1. 


Table 1. Designed parameters of condensate pumps. 


Type Flux Head Rotate Speed Efficiency 
NLT500-570x4S 1427-1630m*/h 346-328m 1490rpm 83.5% 


Matched motor is 2000KW. Active power of a single condensate pump is at about 
1800KW when running at rated capacity. Double suction impeller is used in the first 
stage; the latter three stages adopt single suction impeller. Distribution of pressure 
lifting is in terms of stage. 

During operation, field test indicate that 250 meters head of delivery can meet the 
requirements, but the actual head of delivery of condensate pump is about 320 meters. 
Considering the stability operating of unit, 300 meters actual head of delivery is 
enough. The remaining head of delivery will dissipate in system. This is the main 
cause to retrofit condensate pump. 

Moreover, actual efficiency of condensate pumps is deviated from the designed ef- 
ficiency. The designed efficiency of condensate pump is 83.5%, but the field test 
efficiency is about 78%. 


3 Performances Testing and Analysis 


Performances and pipe resistance tests are carried out for the condensate pumps ac- 
cording to GB/T3216-2005. During test, parameters such as flux, head of delivery, 
electric power, units load should be steady. Fluctuation of values must be less than 
3%. Data for the performances test of condensate pump are shown in Table 2. 

Table 2 shows that the proportion of throttling pressure loss in adjusting valve of 
condensate pump is great. Among the five testing conditions, the head of delivery loss 
in adjusting valve AH accounts for the total head H of delivery at 38.22% to 72.86%. 
So the first step is to reduce the pressure loss in adjusting valve by replace the throt- 
tling adjusting valve with frequency conversion adjusting device. 

In addition, the efficiency of condensate pumps is low. The maximum efficiency is 
77.31%, and only 60.99% for low load at 298.1MW. There exist a large difference 
between actual performance and design value. 
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Table 2. Data for the performance test of condensing pump. 


Units Load (MW) 
Item Units», x\Ss 
298.1 418.7 478.5 538 594.2 
Pump Outlet Pressure MPa 3.596 3.4218 3.321 3.207 2.98 
Pump Inlet Pressure MPa -0.0805  -0.0813  -0.0824 -0.0827 -0.084 


Pressure Before Adjusting Valve MPa 3.56 3.3955 3.251 3.106 2.86 
Pressure After Adjusting Valve MPa 0.8 1.145 1.3 1.445 1.59 


Flux th 761.3 1023.3 1183.7. 1325.1 1456.2 


Kinetic Pressure Difference with 


Pump Inlet and Outlet m 0.137 0.247 0.326 0.416 0.488 


Head of Delivery m 379.06 361.43 351.64 340.23 317.06 
Electric Power kW 1357.3 1518.6 1604.8 1675.2 1739.1 
Shaft Power kW 1289.4 1442.7) 1532.4 1599.8 1660.8 


Proportion of Throttling Pressure 
Loss in Adjusting Valve 


Pump Efficiency % 60.99 69.86 74.01 76.79 = 77.31 


% 72.86 61.79 55.02 48.07 38.22 


In the performance of testing, we also find that condensate pump has casting de- 
fects. There is cellular point on inner cover, and the symmetry of flow passage is bad. 


4 Technical Retrofit Measures and Its Realized 


According to the results and analysis of performances testing, following technical 
retrofit measures have been approved. 

In order to reduce the throttling pressure drop of adjusting valve, the second stage 
Impeller is removal, and leaves the position empty. Thus no guide blade casing need 
to machine, the amount of retrofit work will be reduced. If the second stage Impeller 
need to re-install, any other technical measures are needed. But the guide blade casing 
still exists, which will increase the flow resistance, resulting in more than | % drop of 
the efficiency of the entire pump. 

Improve on the flow section of the pump. Key local molded line is improved by us- 
ing the model optimization testing and patented technology to enhance the efficiency 
of the pump and the ability of preventing cavitations. Detailed measures are: 

(1) Optimization of the blade profile of impeller inlet and outlet. The inlet blade 
profile of the first stage is designed as the fish head shape. Increase the entrance angle 
B, by 1.8° average and the opening port a, by 0.5mm. At the same time, the blade 
pitch error of the entrance is limited to less than 0.5% to ensure uniformity symmetry 
of entrance. Increase the angle of blade discharge B. by 1.2° and reduce the thickness 
of blade outlet from 8.8mm to 3.8mm. The blade pitch error of the outlet is also li- 
mited to less than 0.5% to ensure uniformity symmetry of flow section. 
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(2) Optimization of the flow channel molded line of axial impeller channel. Aug- 
ment the diameter of the throat Dp and the wheel Dg in the first stage impeller from 
280mm to @282.5mm and from @100mm to 99mm separately. The eccentricity of 
Do and Dg is eliminated. Outlet width of impeller is optimized from 73mm to 
76.5mm, and the local eccentricity of inner flow surface on both sides and the baffle 
in central is also eliminated. 

(3) Optimization of the whole flow channel of impeller. Improve of the glossiness 
and smoothness in all flow channels, including the transition zone of the adjacent 
parts, make the irregularity degree less than 0.1mm. This will not only improve the 
inflow and outflow of the impeller, significantly improve the pump efficiency and 
capacity for preventing cavitations, enable the high efficiency range of pumps shift to 
large flow rate, but also enable the flow in the impeller be more uniform and symme- 
try, enhance the stability of pump operation. 

For the other stage impeller, basic principles for optimization are similar to which 
of the first stage, but parameters are different. 

Improve maintenance and assembly process: 

(1) Do static equilibrium, high-speed dynamic balance testing and flaw detection 
inspection for each impeller; 

(2) In strict accordance with the manufacturer’s process requirements to assemble 

and to ensure a reasonable seal gap. 
After carrying out list retrofit technical measures, performances test of condensate 
pump is re-done. Referred methods are the same as before. When at the optimum 
design flow rate Q=1600m*/h, namely capacity is 1588t/h, the head of delivery of 
condensate pump H=263.5m, efficiency of condensate pump n=83.15%. From the 
existing performances test curves, the retrofit reaches the expected goal. 


5 Conclusions 


The designed head of delivery of 600 MW condensate pump is comparatively high 
which results in larger throttling loss and reduces the system performance. For 
NLT500-570x4S condensate pump in our plant, energy conservation retrofit is 
processed based on existing problem analysis and performances test. To solve these 
problems, optimization improvements, such as reduce the throttling pressure loss in 
adjusting valve, optimization of the flow section, and improve of maintenance and 
assembly process are performed. Field test indicates that when operating at optimum 
designed flow rate condition, maximum efficiency of condensate pump is improved at 
a order of 5.5%. 

All the work done in our plant can provide a reference for energy conservation re- 
trofit on condensate pump of 600MW unit. 
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Abstract. Railway clearance detection system based on rotating laser scanner 
sensors was designed for the realization of non-contact detection of railway 
clearance. In the system, in order to acquire and synchronize the data of Laser 
scanner sensors, gyroscope, position transducers and speed sensor, the FPGA- 
based PCI data acquisition synchronization card was designed. This paper de- 
scribes the hardware design for acquisition synchronous card, PCI interface 
chip configuration, FPGA core programming. The experiment proved that using 
the data acquisition synchronization card, 2-channels of 500k baud rate serial 
data, 1-channel of 115200 baud serial data, 4-channel A/D, 2-channel high- 
speed IO signals can be acquired and synchronized, the time accuracy is lms, 
without missing data. This ensures reliable operation of the whole railway 
clearance detection system. 


Keywords: FPGA, Acquisition, Synchronization. 


1 Introduction 


As the rapid development of the high-speed railway and urban railway transport, the 
acceptance of lots of new lines and reconstruction lines needs high speed clearance 
detection device. The traditional clearance detection system is based mainly on 
contact mechanical tentacles, which is low degree of automation and difficult to com- 
pensate the vibration. With the development of photoelectric technology, the laser 
scanner sensor based on light propagation principle is widely used in detection system 
of high-speed railways abroad now. By calculating the time span between the emis- 
sion and reflection of the laser, the scanner measures the distance between the sensor 
and the obstacle. When the angle changes with a certain angular velocity, the laser 
pulse can scan different perspectives on the obstacles to obtain the profile size of the 
current scanning section. 

The system uses two high-speed laser scanners to detect the railway clearance di- 
mension in real-time. The clearance data are stored in database with the railway line 
kilometer as the index, through data processing to get the current position and the 
overloaded line clearance gauge size, easy to railway line construction, renovation 


* This work is partially supported by the State Key Laboratory of Rail Traffic Control and 
Safety (Contract No. RCS2009ZT012), Beijing Jiaotong University. 
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and maintenance. Comparing the historical clearance data of the same railway line, 
we can master the variation trend of the line clearance dimension and exclude the 
hidden danger that affect the railway transportation safety timely, for the safe opera- 
tion of rail traffic is significant. 


2 System Solution 


The overall scheme of the clearance detection system is shown in Fig. 1. 


Laser Scanner 
Sensor >) -—>| 
Lo ~ 
Gyroscope [> signal |»! PCI PCI UP 
ee cs eC Ree 
Position circuils = 
‘Transducer 
Speed Sensor > ad 


Fig. 1. The overall scheme of the clearance detection system 


The LMS200 series high-speed rotation laser scanner sensors made in SICK, a 
company of Germany, are installed directly on the railroad car or dedicated detection 
vehicle to measure the clearance dimension of the line. LMS200 can be achieved 
within the 180° scan, repeated measurement error is less than +5mm, data transfer rate 
up to 500kbps, a minimum sampling interval of 0.5°. When two scanners installed on 
the detection car with an angle of 70°, we can acquire the clearance dimension within 
a range of 290°. 

The laser scanner sensors installed on the detection vehicle will have six degrees of 
freedom uncertain gesture, which will affect the clearance size measurement accura- 
cy, during the running of the vehicle. System uses one gyroscope and four position 
transducers to measure the body's movements gesture in real-time, and compensate 
the clearance data with the rotation matrix method, which will correct the measure 
error caused by vehicle vibration. 

In order to determine the location of exceeding clearance data, the clearance 
dimension data acquired by laser scanner need to correspondence with the line kilo- 
meter, therefore, the system also adds a speed sensor for real-time measurement of 
vehicle speed and location information. 

All the data in the collection must be synchronized, otherwise the dynamic com- 
pensation is unable to carry on and it will affect the measure accuracy of clearance 
dimension. 

Currently, the common method of data acquisition is to use various data acquisi- 
tion card. With this acquisition mode, the sensors’ data are read from each acquisition 
card and the PC time is recorded at the same time, which is used as the time label 
of the current data pack. Each kind of measure data synchronizes by the time label of 
the data pack. While because of the internal cache and the time delay of the data 
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acquisition cards, and the PC time is not precise enough, it is difficult to achieve accu- 
rate data synchronization. 

In order to achieve high accuracy synchronization in data acquisition, a FPGA- 
based PCI data acquisition synchronization card was designed. In this card, the high- 
speed serial port capture, AD acquisition, high-speed IO signal acquisition functions 
are integrated into one FPGA chip, while a sync pulse of 1KHz is generated in the 
FPGA. The count of the real-time synchronization pulse is stored in 4Byte registers, 
and recorded following each packet of data. Using the 4Byte sync pulses count, the 
acquisition time of each data can be known, its accuracy is |ms, with these time tag, 
synchronization can be achieved. 

Large amounts of synchronization data which were collected by the FPGA need to 
be read and stored in PC in high-speed, so the PCI bus interface is used. PCI (Peri- 
pheral Component Interconnect) bus data transfer rate is up to 132MB/s, plug and 
play, and is widely used in various computer systems areas. 


3 Hardware Design of the FPGA-Based PCI Data Acquisition 
Synchronization Card 


PCI data acquisition synchronization card hardware circuits include signal condition- 
ing and data acquisition circuits, FPGA chip, PCI interface circuit and the configura- 
tion chip. 


3.1 Signal Conditioning and Data Acquisition Circuits 


1) Laser Scanner Sensors and Gyroscope: The laser scanner sensors and gyroscope 
output through the RS485 serial port, the communication baud rate of laser scanner 
sensor is 500kbps and the gyroscope sensor is 115200bps. System first uses the 
MAX490 as a level converter chip to convert the RS485 signal into a TTL level sig- 
nal, the signal is isolated through a photo coupler and sent into the IO port of the 
FPGA. 

2) Position Transducer: The output of the position transducer is 4-20mA analog sig- 
nal, it is converted to 0.48-2.4V voltage signal with a 120Q precise resistor. Through 
a capacitor filter and the limited voltage protection circuit, the signal is sent to the 
AD7888. The TTL signals from the SPI port of AD7888 are sent into FPGA's IO port 
after isolation by photo couplers. FPGA can read 12-bit AD data by designing the IP 
core according to the timing of AD7888. 

3) Speed Sensor: Speed sensor outputs two pulse signal, the phase difference is 90°, 
the two pulse signal are sent to the FPGA’s IO port after isolation by photo couplers. 
One signal is used as pulse count and kilometer calculation. According which signal’s 
phase is ahead, we can know whether the vehicle is moving forward or backward. 


3.2. Selection of FPGA Chip 


FPGA chip is the core of the system, the ALTERA Cyclone series FPGA - EP1C12 is 
used in the system, it has 12,060 logic cells, 52 M4KRAM, 239616 total RAM bits, 2 
PLL, 249 pins of available user IO. It’s easy to develop with FPGA, just connect 
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FPGA's I/O ports to external counterparts, and then design the internal logic of the 
system, that’s ok. You can then implement incremental changes and iterate on an 
FPGA design within hours instead of weeks. As system requirements often change 
over time, the cost of making incremental changes to FPGA designs are quite negligi- 
ble. All of the data acquiring programm, synchronous pulse generator, FIFO, PCI 
interface timing and other functions are realized in the FPGA internal, FPGA internal 
functional block diagram shown in Fig. 2. 
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Fig. 2. FPGA internal functional block diagram 


3.3. PCI Interface Chip 


PCI9052 is a common PCI bus interface chip of U.S. PLX company. Using a dedicat- 
ed PCI bus interface chip, complex PCI protocol will not be concerned, we only need 
to develop hardware and drivers, it will reduce the development cycle greatly. The 
PCI9052 is compliant with PCI r2.1, supporting low cost slave adapters, it allow rela- 
tively slow Local Bus designs to achieve 132 MB/s burst transfers on the PCI Bus. Its 
Local Bus clock runs asynchronously to the PCI clock, allowing the Local Bus to run 
at an independent rate from the PCI clock. PCI9052 device contains four local chip 
select signal and five local address spaces, chip select and address space can be confi- 
gured through the EPPROM or host. The PCI9052 supports 8-, 16-, or 32-bit Local 
Buses, which may be Non-Multiplexed or Multiplexed mode. PCI bus interface using 
PCI9052 is shown in Fig. 3. 


3.4 Configuration of the Serial EEPROM 


PCI bus supports three physical spaces: the memory address space, I/O address space 
and configuration space. All the PCI devices must provide configuration space. Sys- 
tem uses a serial EEPROM - 93CS46 to configure PCI9052. The method of using 
EEPROM to configure PCI9052 will be introduced. 
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1) PCI Configuration Registers: Registers of OOH-OFH in EEPROM are used to con- 
figure the PCI configuration registers. Only register of OOH needs to be configured, 
the others use the default settings. Writing configuration data to EEPROM, it should 
be attentive that low front, high in the post. For example, we write 0x10B5 to regis- 
ters of 2H and 3H, 2H = OxBS, 3H = 0x10. 

2) Local address space: Registers of 10H, 12H in EEPROM are used to configure the 
local address space. The hexadecimal data FFFFFFE1 is written to the registers of 
10H, 12H in this system, it means that the local address space 0 maps to PCI-IO 
space, 32-bit PCI addressing mode is used, the local effective address range is 00H 
to 1FH. 

3) Bus Region Descriptors for Local Address Space: Registers of 38H, 3AH in EE- 
PROM are used to configure the bus region descriptors for local address space. The 
hexadecimal data D0118940 is written to the registers of 38H, 3AH, so 8bit PCI bus 
mode is used in this system. 
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Fig. 3. PCI bus interface 


4 FPGA Core Programming 


Most of the system functions are implemented in the FPGA, FPGA program mainly 
completes the sensors’ data acquisition, synchronization pulse generation and count- 
ing, PCI9052 interface control timing and other functions. The design of receiving 
data from serial port is more complex, the following will detail the development of 
the FPGA serial data reception program. 
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4.1 Design of Serial Data Reception 


The data output rate of laser scanner sensor is 500KHz, each packet is 732 bytes, 37.5 
packets per second. In order to receive the serial data accuracy, and add synchronization 
pulse count in the end of each packet of data, the system needs to complete some work 
as the following: 


1) Generate baud rate clock multiplier: Serial data communication is an asynchron- 
ous data transfer mode, carrying out according to a certain baud rate, data receiver 
detects the transfer signal in accordance with the certain baud rate, so as to obtain 
communications data. During this process, except receiving data bit, it is also the need 
to detect start bit and stop bits, shift the data bit, add the synchronization pulse count. 
So we need a clock which is several times of baud rate clock to get more time to 
complete all the work, the system uses 32 times. 
2) Serial to Parallel Converter: Output serial data format of the laser scanner sensor 
is: 1 start bit, 8 data bits, 1 stop bit, no parity bit. All bits in the serial data stream 
should be detected, and 8 serial data bits need to be converted to parallel data. 
3) Add Sync Pulse Count: After receiving a byte of data, the current byte and the 
previous byte on a combination compares with the packet's end flag, if meet, after 
writing the current byte to FIFO, the sync pulse count - 4 bytes are written to the 
FIFO. 

Diagram of serial data reception program is shown on Fig. 4. 
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Fig. 4. Diagram of serial data reception program 


4.2 State Machine 


Serial data receiving process including a total of 7 states, namely IDLE (idle), START 
(start bit detection), SHIFT (character shift), STOP (stop bit detection), STORE (pa- 
rallel data storage), DETECT (data Package end flag detect), WRPULSE (write sync 
pulse count). The data bit count value-COUNTER_BIT, the 32 times the baud rate 
clock count value-COUNTER32, the RXD input value and data Package end flag 
detecting result are combined together as a basis for judging the state machine trans- 
fers. State transition is shown in Fig. 5. 
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Fig. 5. State transition of serial data reception 


5 Experimental Result 


After completing the design work, we had a test in the laboratory by connecting the 
PCI acquisition synchronization card and all the sensors. The data acquisition fre- 
quency of laser scanner sensor is 37.5Hz, gyroscope, position transducer, Speed Sen- 
sor are all 100Hz. Data samples taken within 100ms, when the packet acquisition is 
complete, the synchronous pulse count value (in ms) are shown in Table 1. 


Table 1. The Synchronous Pulse Count Value 


Packet Speed Position Laser Scanner _ Laser Scanner 


Gyroscope 


Num Sensor Transducer Sensor | Sensor 2 
1 151815 151815 151812 151819 151821 
2 151825 151825 151822 151845 151848 
3 151835 151835 151832 151872 151875 
4 151845 151845 151842 151899 151901 
5 151855 151855 151852 
6 151865 151865 151862 
7 151875 151875 151872 
8 151885 151885 151882 
9 151895 151895 151892 
10 151905 151905 151902 


In this system, the time sampling interval of laser scanner sensor is 26.6ms, the 
others are 10ms. Among them, the laser scanner sensors and gyroscope are actively 
sending data, so we cannot control the time of generating data for acquisition syn- 
chronization. For this reason, the synchronous pulse count value is added at the time 
of completing data acquisition to get the moment of data acquisition, the time accura- 
cy is Ims. By this time label, look for the closest of all data packets, put all this data 
packets as one group data to process and synthesize, and then to achieve effective data 
synchronization. 

Table 1 shows, speed sensor and position transducer data acquisition can be syn- 
chronized by control. The time of Gyroscope’s data generation is at random, but the 
time interval is invariably. The time sampling interval of Laser scanner sensor is 
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26.6ms, need to find matching data by algorithm. In the system, if the time difference 
of collection between the Laser scanner sensor and Speed Sensor less than or equal to 
Sms, we think them as a group of data. Such as Table 1, the third data packet of Laser 
scanner sensor and the seventh packet data of others is one group data. 

The Railway Clearance detection System with the FPGA-based PCI data acquisi- 
tion synchronization card is used on the Road-Rail Amphibious Monitoring Vehicle 
developed at Beijing Jiaotong University. The boundary dimensions of Yizhuang 
Beijing metro line depot was detected with the vehicle in the acceptance check of the 
new subway line. After synthesizing the data collecting from the area with ladders, 
the result shown as Fig. 6. The semi-closed area in the middle of the figure is a border 
of clearances, on the right-hand is the ladder, the size of the ladder is out of range. 
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Fig. 6. The boundary dimensions of Yizhuang metro line depot 


6 Conclusions 


Two channels 500k baud rate serial data, one channel 115200 baud serial data, four 
channels A/D, two channels high-speed IO signals can be acquired and synchronized, 
in the FPGA-based PCI data acquisition synchronization card of railway clearance 
detection system, time synchronization accuracy is Ims, no packet loss. When the 
monitoring vehicle runs at the speed of 5Km/h, the interval of each profile clearances 
is 3.7cm. The line position can be matched with the clearance data by data processing, 
when there is exceeding clearance, according to the corresponding kilometer mark, 
we can quickly find the location of overrun points. The FPGA-based PCI data acqui- 
sition synchronization card is used on the Road-Rail Amphibious Monitoring Vehicle 
developed at Beijing Jiaotong University, the clearance of railway and subway line 
can be acquire in real-time. Now the vehicle has been used in the acceptance test of 
Beijing new subway line, through the use of this vehicle, we can monitor the new 
railway line quickly, find the position which out of clearances, facilitate the construc- 
tion rectification. This system also provides a new method for the urban rail transport 
monitoring in the future. 
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Abstract. In this paper, the electrical transient mode of induction generators is 
studied, establishing the short-circuit current sequence component of wind ge- 
nerators are when faulty occuring in collect feeder, and analyzed the major ef- 
fective factors that influence short-circuit characteristics such as wind turbine 
types, fault type, fault resistance and the number of generators participating ect. 
Take a 49.5MW wind farm as an example in North China, the dynamic models 
with asynchronous generators and double-fed induction generators are establis- 
hed based on PSCAD/EMTDC, the fault characteristics of two types wind 
generators are thoroughly simulation analyzed, the results of simulation are 
identical with theoretical analysis. The simulation results show that these in- 
fluence factors should be considered when we design the relay protection of wind 
farm, it is helpful to enhance the tripped performance of relay protection. 


Index Terms: asynchronous generators, double-fed induction generators, 
short-circuit characteristics, Simulation analysis. 


1 Induction 


Owing to wind power play an important role in sovling environmental problem and 
coping with enegy crisis, so the installed wind turbines is rapidly increasing worldwide. 
with the fast development of wind power, the impact of wind generator on power 
system become an important reseacher topic[1], however, most study focus on transient 
stability and power quality, very little attention has been given to analyze the fault 
characteristics of wind turbine generators and received less achievement. 

In the past, the relay protection of wind power system neglect the short-circuit cur- 
rent contrbution of wind farm[2-3]. With the increase of capacity of wind power in 
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power system every year, short-circuit current of wind turbine contribution is also 
growing, when the wind power capacity reach to a certain extant, short-circuit current 
of wind farm will be higher than short-circuit current of grid, and the protection of wind 
power system must consider the short-circuit current contribution of wind turbine 
generators, and fault characteristics is foundation of relay protection setting, conse- 
quently, research on the short-circuit characteristics of wind farm will be significant. 

Wind generator power is different form conventional thermal, hydro power and 
nuclear generation, which arise different analysis methods regarding the short-circuit 
characteristics and relay protection of wind farm. Also, reseach regarding wind farm 
short-circuit characteristics and relay protection are still limited as follows: The trip 
boundary of distance protection should be setted adaptively that take into account the 
number of generators, loading level and system frequence[4]. The short-circuit cha- 
racteristic of asynchronous wind generators was analyzed from fault type and varying 
wind speed [5-7]. The primary objective of this paper is to simulation analysis the 
short-circuit characteristics of squirrel cage induction generators and double-fed 
induction generators, consider the main effctive factors that influence fault characte- 
ristics, establishing the dynamic simulation model using PSCAD/EMTDC, the con- 
clusions could applied on the current instantaneous protection. A 49.5MW wind farm 
in North-China is considered as a simulation example in this reseach. 

The paper is organsied as follows. The three main types of wind turbine are intro- 
duced in section 2, section 3 analyzed the mathematical model of wind turbine gene- 
rators,the description of the North-China wind farm that used as simulation example in 
section 4. section 5 highlights the simulation results in different influence factors. 
Finally, section 6 summarized the conclusion and sppendix that used in this paper. 
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Fig. 1. Three main types of wind turbine 


Short-Circuit Characteristics 27 


2 Wind Turbine 


Nowdays, three main types of wind turbine are commonly being installed in china. The 
fixed speed wind turbine with squirrel cage asynchronous induction generator directly 
connect to grid(Fig.1 a), and varible-speed constant frequence wind turbines with 
wound induction generator and a converter on rotor circuit known as doube-fed in- 
duction generators, Fig.1c show the variable speed wind turbine, with the generator 
connect to the grid through a full power converter in the stator circuit. 


3 Mathematical Model 


This section describes several dynamic mathematical models of wind turbines and 
induction generators. 


3.1 Wind Tubine 


The relation between the wind speed and aerodynamic torque may be described by the 
following equation[8-9]: 


V, 2 

3 

wns ae (1) 
Where TM is the aerodynamic torque extracted form wind[Nm], p the air densi- 
ty[kg/m3], R the wind turbine rotor radius[m], Vy the equivalent wind speed, B the 
pitch angle of wind turbine[deg], Cp the aerodynamic efficiency, the tip speed ratio 1: 


1 
Ty meee 


Qa Pink (2) 
Vw 


here @, , is the rotational speed of wind turbine [rad/s]. 
Numerical approximations have been developed to calculate Cp for give values of B 
and i, here the following approximation is used: 


-12.5 


C, = 0.22(—° 0.4B-5.0)e 7 (3) 


The dynamic operation of the induction generators is governed by the swing equation 
given belowp [10]: 


pi? or, -7, (4) 
dt 


Where T,, mechanical torque applied on the rotor of the associated wind turbine(Nm), J 
the wind turbine mechanical inertia(kg.m’), o the rotor speed of generator (rad/s), T. is 
generator electro-magnetic torque(Nm). 

In (4), T, is electro-magnetic torque developed in the induction generator at the gi- 
ven speed is in proportion with the square of the terminal voltage as follow: 
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T, = KsU” (5) 


where K is constant value depending on the parameters of the machine and s is the 
machine slip. 


3.2 Induction Generator Model 


Fig.1 show the one line diagram of the implemented d-q equivalent circuit of induction 
generators[11-13]: 
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(b) d-axis equivalent circuit 


Fig. 2. d-q axis equivalent circuit 


Detailed explanation of this equivalent circuit was clearly describled as follow: 
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where ug, and u,, represent thestator voltage, uy,and uy, the rotor voltage, wa; and Was 
represent the stator flux linkage , wa, and y,, the rotor flux linkage, ig,and i,,, the stator 
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current, idr. i,, the rotor current, ws the synchronous speed, the parameters of the 


machine L,, r,, L,, r,, and L,, represent the stator reactance, stator resistance, rotor 
reactance, rotor resistance, and mutual reactance respectively, s is the rotor slip. 


3.3 Influence Factors of Short-Circuit 


Fig.3 show the line diagram of the collect circuit for single phase-to-ground fault. 
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Fig. 3. Line diagram for single phase-to-ground fault 


In power system, when occuring the fault, the fault current sequence component can 
be written as follow: 
po Fis (8) 
1 n 
Zi3+Z,” 
For the single phase-to-ground fault, the sequence currents component of wind gene- 
rators can be written: 
Lr=L,° =I" 


Ey aly Lay 
Ly + Ly tLoyytZig tZog tZyg +3Rp 


(9) 


where 


Ziw=ZiswtZiw .ZiG=ZisctZine 3 
Z2w=ZLoswt LowZ2G=L2sGt L216 ; 
Zow=Zoswt Zo.w,Loc=Zosct Zoe: 


where Iy the prefault current, Ew the prefault voltage of wind generators, Z)sy, Zosw 
and Zosw the positive, negative and zero sequence impedances of wind generators, and 
Zitw, ZoLw and Zozw the positive, negative and zero line sequence impedance from bus 
W to fault location, Z)s¢, Z2sg and Zgsg the positive, negative and zero sequence im- 
pedances of system, Z);¢, Z,¢ and Zo,g the positive, negative and zero line sequence 
impedance from bus G to fault location. 

When occur two phase fault and three phase fault, short-circuit current coule be 
describled in(8), but the difference among three fault type is the Z,. 

Due to the nature stochastic and intermittent of wind speed, wind turbine always 
withdrawal and even working in motor sometime, so the number of generator partici- 
pating will change and the quivalent impedance of generators will change. We can 
derive the major effective factors that influence short-circuit characteristics from(9) 
such as wind turbine types, fault type, fault resistor and the number of generator par- 
ticipating in a time. 
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4 Wind Farm Model 


The aimed study is carried out a wind farm in north china as simulation examlpe. The 
farm was structured with three collect circuits and all capacity is 49.5 MW, the collect 
circuit 320 contains 20 wind enerators( 750kW ) providing a total power of 1SMW, the 
wind generators use 800kVA, the collect circuit 322 contains 10 wind generators 
(1500kW double fed induction generators) provide a total power of 1SMW,the wind 
generators use 1600kVA tranfromers to set-up from the 0.69kV rated voltage to the 
35kV collect circuit voltage, wind farm collector line length 5km, the collect circuit 
connected with wind farm substation to set-up voltage 110kV and connected 16km 
lines to 110 grid hebei province. 


3a An uo 


| jus Aiie 


-—o—o es Tanne Siri 


K—o—s—f-_ kl 


(a) collect circuit 320 


Double-bed 
indnctiompenaranor 
304 


SSRW7LIAY TO 
nl = 
GF LSI (> ] 4 v7 
Traut Siri 
cine 


(b) collect circuit 322 


Fig. 4. Model diagram of the simulation system 


5 Simulation Results 


5.1 Wind Turbine Type 


We assume that the active power of squirrel-cage induction generator and double-fed 
induction generator is similar. Figure.5 Show the simulation results of three 
phase-to-ground fault occuring in the place of 2.5km distance k1 and k2 away from the 
collect circuit 320 and 322, In this case, we operated the wind all generators with 
maximum active power output. 

The simulation result show that the circuit current instanours value of squirrel cage 
induction generator and double-fed induction generator is 4 to 5 times rated 
current value, because squirrel cage induction generator do not have independent 
field windings to develop the required electro-magnetic field in the air gap of the 
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machine, therefore, when the three phase-to-ground happen, the terminal voltage of 
generator have decreased and the grid can not continue to provide exciting for wind 
turbine, The asynchronous short circuit current contribution drops from the initial value 
to zero in a few cycles. however DFIG with wound rotor induction generator, the 
short-circuit current gradually decay and due to the converter of DFIG, when occuring 
fault in grid, DFIG could supply continuous short-circuit current. The current insta- 
nours protection is infulenced by short-circuit current instanours value, when setting 
the relay protection of wind farm, we should consider the short-circuit current of dif- 
ferent wind turbine, Current protection II is infulenced by trip time, so we should 
consider attenuation characteristic of wind generators. 
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(b) three-phase-to-ground fault current of double-fed induction generator 


Fig. 5. Short-circuit current of Wind turbines response to ABC-G Fault 


5.2 Fault Type 


With A-B fault happening occuring in the place of 2.5km distance k1 and k2 away from 
the collect circuit 320 and 322, In this case, we operated the wind all generators with 
maximum active power output. The resulting two phase fault short-current current as 
shown in fig.6. Due to the c-phase without fault,network can continue provide excita- 
tion to the wind turbines from c-phase, therefore, asynchronous generators and 
double-fed induction generator can provide continuous short-circuit current when 
two-phase fault. 
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(b) short-circuit current of double-fed induction generators 


Fig. 6. Short-circuit current of Wind turbines response to AB Fault 


5.3. The Number of Wind Generators 


Unlike conventional generation with thermal or hydro, wind farm generators are smaller 
in size, and a group of them collectively harness bulk power from a large area. The 
number of wind generators at a time will vary due to withdrawal of generator at high or 
low wind speed, the equivalent impedeance of the wind farm will change depending on 
the number of wind generators connected to the collect circuit at a time. Three situation 
are simulation where the sum capacity (wind generators participating at an instant) is 
15MW, and in case2 the sum capacity is 12MW, the capacity is 9MW in case3. The 
short-circuit current curves of wind generator are described in Fig.7. 


Fig. 7. Short-circuit current in different number of Asynchronous wind generators 
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The impact on short-circuit current is very obvious as result of different number of 
wind generators. The simulation results as shown in Fig.7, It is clear that the 
short-circuit current will increasing when the number of wind generator increasing, 
because the number of wind generator increasing, the equivalent impedeance of wind 
generators will descreaing, and according to the formula (8-9), the short-circuit current 
will increasing, therefore the current protection need to be updated in accordance with 
the number of wind generators participating in the wind fatm. 


5.4 Fault Resistance 


Fig.8 show the short-circuit current of wind turbine in different fault resistance. The 
simulation results show that due to the value of Rr is zero,causes the terminal voltage of 
generators drop to zero, so the short circuit current contribution drops from the initial 
value to zero in a few cycles; however,the value of Rp is 4Q, the terminal voltage of 
generators drops to 0.2pu, so the wind generators can contribute continuous 
short-circuit current. 


Fig. 8. Currents of wind generators response to differnt fault resistance at position k1 


6 Conclusion 


The objective of this paper aimed to investigate the short-circuit characteristics of 
asynchronous generators and double-fed induction generators, and dynamic model was 
established in PSCAD / EMTDC. the results showed that: when ocurring 
three-phase-ground fault, the asynchronous short circuit contribution drops from the 
initial value to zero and DFIG could contribute continuous short-circuit current; 
two types wind turbines can provide continuous short-circuit current when two-phase 
fault happening; and the short-circuit current will increasing when the number of wind 
generator increasing. With the capacity of wind power increasing in power system, we 
need to consider the short-circuit current contribine of wind farm when we setting the 
relay protection of wind farm. The simulation results show that the major factors in- 
fluence the short-circuit characteristic are different wind turbine, fault type and the 
number of generator operating in a time, we should utilize the adaptive protection 
method to set the current relay protection using these influence factors. 
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Appendix 


Table 1. 110kV Line Parameters 


Value 


Positive sequence resistance 0.1320/km 
Positive sequence reactance 0.4010/km 
Positive sequence susceptance 2.85x10-6s/km 
Zero sequence resistance 0.396O/km 
Zero sequence reactance 1.2030/km 
Zero sequence susceptance - 
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Table 2. 35kV Line Parameters 


Value 
Positive sequence resistance 0.1320/km 
Positive sequence reactance 0.3570/km 
Positive sequence susceptance 3.21x10-6s/km 
Zero sequence resistance 0.396O/km 
Zero sequence reactance 1.071Q/km 


Zero sequence susceptance 


Table 3. 110/35 kV Transformer Parameters 


Units Value 
Rated power MVA 50 
Ratio kV 110/35 
Connection -- YND11 
Leakage reactance Pu 0.1 


Table 4. 35/0.69 kV Transformer Parameters 


Units Value 
Rated power MVA 1.6 
Ratio kV 35/0.69 
Connection -- Dyn11 
Leakage reactance Pu 0.01 


Table 5. 750kW squirrel-cage induction generator Parameters 


Value 
Stator resistance Rs 0.0053pu 
Stator unsatured leakage reactance Xs 0.1060pu 
Unsatured magnetizing reactance Xm 4.0209pu 
Rotor unsatured reactance Rr 0.0070pu 
Rotor resistance Xr 0.1256pu 


Table 6. 1500kW double-fed induction generator parameters 


Stator resistance 
Stator unsatured leakage reactance 
Unsatured magnetizing reactance 
Rotor unsatured reactance 
Rotor resistance 
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Optimization Design of Lifetime Distribution in Power 
Diode with Fast and Soft Recovery 
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Abstract. The fast and soft recovery is required for power pin diodes support- 
ing the use of IGBT. In this paper, the effects of the local low lifetime and the 
overall lifetime on device characteristics have been simulated and analyzed by 
using of software. Based on their effects on the reverse recovery time t,,, the 
forward voltage drop Vp and the reverse leakage current Ip of power pin diode, 
those parameters, such as the base location L, lifetime Tp and allover life- 
time Tp In local low lifetime control technology and the overall lifetime tp, are 
studied and the optimal base lifetime distribution parameters are obtained. 
Those results have practical reference value for research and manufacturing of 
diodes with fast and soft recovery. 


Keywords: local low lifetime, fast and soft recovery, reverse recovery time. 


1 Introduction 


With the Power Electronics towards high-power, high-frequency-based and modular 
development, the development of the indispensable fast soft-recovery diodes (FRD), 
matching with the IGBT and power MOSFET high-frequency electronic devices, is 
urgent and has important practical significance. In recent years, a new lifetime control 
technology (Local lifetime control technology) has attracts more attention[1]. To form 
a sheet local low-lifetime region with certain thickness, the high-density recombina- 
tion centers are introduced on the position with a very small distance to the pn junc- 
tion. The position is perpendicular to the direction of the pn junction plane. Local low 
lifetime regions have great impact on performance parameters of FRD and are advan- 
tage to realizing the trade-off among of those parameters. 


2 PIN Power Diode Basic Structure and Working Principle 
2.1 The Basic Structure 


The structure of PIN diode is composed of p* region, N , region and I region. P*tre- 


gion and N“ region are high doping concentration. I region is N-type high-resistivity 
material and its doping concentration is very low. The basic structural model and the 
impurities distribution were shown in Figure | (a) and Figure | (b), Shaded area is the 
local low lifetime region. 
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(a) (b) 


Fig. 1. The basic structure and impurity distribution of PIN diode 


2.2 Theoretical Analysis 


When the P-I-N power diode is added forward voltage, its forward voltage drop is 
low. When the diode is added reverse voltage, its reverse voltage is high[2]. When the 
voltage added on power diode changes from forward voltage to reverse voltage, the 
diode can not be cut-off immediately[3][4]. The reason is that many carriers stored in 
I region can not disappear immediately. It needs period of time that these carriers are 
extracted or recombined completely and this is the reverse recovery time t,,. In order 
to protect the device in the main circuit, the diode reverse recovery time must be very 
short. The reverse recovery characteristics of diodes are shown in Figure 2. 
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Fig. 2. Reverse recovery characteristics of diodes Fig. 3. Test circuit of reverse recovery time 


The reverse recovery time of diode is consisted of two parts[5][6]. One is storage 
time t, and the other is recombination time t,. The diode is added reverse voltage Vp 
at t = t;. Time period from ty to t, is t,. Time period from t, to t, is recombination time 
t,. The reverse-recovery softness factor S is described by S = t,/t,. Namely, the ratio 
of recombination time and storage time 

Based on the study of PIN lifetime control technology of power diode, it can be 
found that the reverse recovery time of diode can be improved significantly, while its 
forward voltage drop and reverse leakage current are almost no changed, as shown in 
Figure | (b), by introducing high-density recombination centers in I region. 
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3 Simulation Analysis 


3.1 Model Building 


The purpose of this paper is design the diodes supporting the use of IGBT and its 
reverse voltage is 1200V. Its technical parameters are shown in Table 1. The structure 
of the device is shown in Figure | (a). The specific structural parameters as follows, 
the total epitaxial wafer thickness H is 150 4m, epitaxial thickness h is 30 dm 


(thickness of N” region), the concentration yt 8 5x10" / cm? (concentration of 
N region), the total area is 9000 9000 sum? , P” area is 86008600 um" ; 


depth of P* is 304m, the concentration of P* region n is 5X10° / un , the 


thickness of N region W is 90 4m, the concentration of N’ region n,- is 


1x10'*/ 4m’, the width of Field Limiting Ring are 15 4m, 17 £m and 19 lm, 
the rings spacing are 50 4m, 48 Lim and 46 Lim, respectively. 
The test circuit of reverse recovery time is shown in Figure 3. Simulated conditions 


are that forward current I, equals to 150A, reverse voltage V p is 600V and reverse 


current fall rate di/dt is 1650 A/ Zs [6]. 


Table 1. Main technical parameters 


Parameters Qualification U Notes 
nits 
Rated Forward Average Current I; 150 A 
Repeat peak reverse voltage Vp 1200 Vv 
Reverse leakage current Ip <27 HA | Reverse Voltage Vp=1200V 
Forward voltage drop Vr <1.6 Vv Forward Currentl,=150A 
Reverse recovery current Ippm <187 A di/dt=1650A/ Ls 
Reverse Recovery Time t,,; <400 ns 


3.2 Results and Discussion 


3.2.1 Simulation and Analysis of Location of Low-Lifetime Region 


Low-lifetime region width W, lifetime T _ and the overall lifetime of the device T, 


are 10 4m, 1X10°*s and 3X10°s, respectively. The value of other parameters is 


given in the previous models. The effects of the low lifetime position L on the recov- 
ery time t,, is simulated by changing the position of L from 30 4m to 120 44m (mov- 


ing low lifetime region from near the P* area to the N” area, each additional 10 zm). 
The simulation results are shown in Figure 4. 
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Fig. 4. The effect of the location L on the Fig. 5. The effect of the location L on the 


reverse recovery time trr forward voltage drop V,. 


With the increasing of the location L from 30 4m to 60 {im the reverse recovery 
time t,, decreases from 450ns to 380ns. When the location L is 60 ém( center of I 


region), the reverse recovery time t,, is minimum. With the increasing of the location 
L from 60 44m to 120 4m, the reverse recovery time t,, increases from 380ns to 


430ns. When the diodes are cut-off, holes and electrons near the p* region and N " 
region can be extracted rapidly, while the carriers in the middle of I region can not be 
extracted but recombined, which needs a long time. Therefore, the reverse recovery 
time t,, is longer. When the location L is moved from 30 4m to 60 4m, the recombi- 


nation contribution is more notable and the reverse recovery time decreases. When the 
location L is far from the center of I region (L>60 //m ), the recombination contribu- 
tion is not remarkable and the reverse recovery time increases. When the location L is 
in the center of I region (L = 60 £/m), large number of holes and electrons without 
extracting from I region are recombined through low lifetime region and the reverse 
recovery time shortest. 

It can be seen that, from Figure 5, the forward voltage drop V , is almost not af- 


fected by the location L of low-lifetime and the value is only 0.007V. In the local 
low-lifetime control technology, the thickness of high-density recombination centers 
is very thin so its impact on forward voltage drop is not notable. In addition, the effect 
on leakage current is so small that it can be ignored. 


3.2.2 Simulation and Analysis of Local Minority Carrier Lifetime 
The location of the low lifetime region L is 60 /m and other parameters is un- 
changed. Minority carrier lifetime is in low lifetime region is changed from 


1x10 s to 1x10°’s. The effects of different minority carrier lifetime on recovery 
time t,, and forward voltage drop have been simulated and analyzed. The results are 
shown in Figure6, Figure7 and Figure8. 
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Fig. 6. The effect of the lifetime T Z on the Fig. 7. The effect of the lifetime 7 : on the 


reverse recovery time t,, forward voltage drop Vr 


Figure 6 shows that the reverse recovery time t,, is reduced from 420ns to 340ns 
with the lifetime 7; being decreased from 1x10~s to 1X10°’s. When lifetime is 


less than 1X10~’s, th of the forward voltage drop V p is only 0.009V. The effect of 
lifetime on the forward voltage drop is very little and the local low-lifetime iz is 


identified as 1X10°°s. 


3.2.3. Simulation and Analysis of the Overall Minority Carrier Lifetime 
The location of the low lifetime region L is 60 2m, the lifetime of local low lifetime 


region T is 1X10™°s and other parameters are unchanged. The effects of the overall 


minority carrier lifetime on the forward voltage drop Vz, the reverse recovery time t,, 
and the reverse leakage current Ip have been simulated. When the lifetime is de- 


creased from 1X10~s to 1X10~’s, The forward voltage drop V is increased from 
0.930V to 0.931V and the change is not obvious as shown in Figure 8. When lifetime 
Tp is reduced from 1x10 s to 1X10°°s, the forward voltage drop V y is increased 
from 0.931V to 0.946V and the increase speed is rapidly. When lifetime 7, is re- 


duced to 1x10° s, the forward voltage drop V ,, is saturated.e reverse recovery time 


t, is reduced with a slower speed. When lifetime is reduced to 1X10°*s, t, is satu- 


rated and no longer decreased with the reducing of T 2 When lifetime is less than 


1x10° s, the minority carriers in the center of I region have been able to recombine 
completely and the lifetime of minority carriers is no longer the major factor deter- 


mining t,,. Figure7 shows that when the lifetime le is changed from 1x10~s to 


1x10~’ s, the variation 
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Fig. 11. The effect of the lifetime Tp on the 


reverse leakage current [ R 


From Figure9, Figure10 and Figure 11, it can be seen that the reverse recovery time 


t,, is decreased with the decline of minority carrier lifetime T, and the decrease of the 


reverse recovery time is sharp. When the 7, is declined to 1x107 s, the reverse 


recovery time t,, is saturated. The leakage current I , is increased rapidly with the 


decline of minority carrier lifetime 7, . When the lifetime 7, is declined tol X 107 s, 


the leakage current I , continues increase. Considering the relationship between the 


forward voltage drop V,. and the leakage current Ip, the overall minority carrier life- 


time is is identified as 1X10°’s. 


In conclusion, based on the comprehensive consideration of the forward voltage 
drop V,,, the reverse leakage current /, and the reverse recovery time t,,, the optimal 


parameters of diode are as follows, (1) the location of low lifetime region L is 60um, 
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(2)the minority carrier lifetime of low lifetime region . is 10ns, (3) the minority 
carrier lifetime of allover region T, is 100ns. Based on those optimal parameters, the 
simulation results indicate that the forward voltage drop V, is 0.935V, the reverse 


leakage current /, is 1.1pA and the reverse recovery time t,, is 334ns. Various per- 


formance parameters achieve design requirements. 


4 Conclusion 


In this paper, the effects of low-lifetime region location, minority carrier lifetime and 
the overall lifetime on forward voltage drop Vp, reverse leakage current Ip and reverse 
recovery time t,, are simulated comprehensively and systematically. The optimal pa- 
rameters are obtained based on those simulation results. The results indicate that, the 
reverse recovery time can be reduced effectively by introducing local low-life region 
in the base region of p*n'n* fast soft-recovery power diode. The reverse recovery time 
of diodes is related to the minority carrier lifetime of low-life region of, while the 
leakage current and forward voltage drop are almost not affected. The reverse recov- 
ery time is also related to the position of low-lifetime region. When the low-lifetime 
region locates in the center of the base region, the reverse recovery time is minimal. 
Those results provide the instructional methods and conclusions for the optimum 
design of base region lifetime distribution in fast soft-recovery power diode. 
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Abstract. The paper introduces the n=W;/X,,=0.25 mathematical models and 
adopts mathematical model Vp=94p,°7 to design structure parameters of high 
power double-base P*PINN* structured fast soft recovery diodes. Platinum dop- 
ing and electron irradiating technologies are used to mutually control the base 
minority carrier lifetime and distrbution, the design method is used to optimize 
the structure parameter of ZKR300A/ 2500V. The design parameters were 
tested and verified through experiments, which proved the parameters of diodes 
meet the designed target and achieved the level of similar products in abroad 
countries. The results prove that the design method and the selected parameters 
are correct,lifetime control is effective. 


Keywords: FRD, P*PINN* structure, soft recovery, minority carrier life. 


1 Introduction 


At present, new high-power electronic devices such as GTO, IGBT, IEGT, IGCT are 
widely applied in power electronic field. FRD is indispensable as important “partner” 
chip must meet application requirements. Because the reverse characteristic of ordi- 
nary quick recovery diodes is “hard”, it is easy to damage power electronic devices. 
We adopt P*PINN* in replace of ordinary PN and P*IN’, it has large forward current 
Ipy.high voltage Uprm, low-state voltage drop Vpy, and by controlling the base region 
of minority carrier’s life and impurities distribution accurately,it can decrease reverse 
recovery charge Q,,, reverse recovery time t,, and soft characteristic S. The research of 
the device is very important for promoting the development of high-frequency power 
electronic circuits[1][2][3]. 


2 Basic Structure and Principles 


2.1 The Basic Structure 


P*PINN* diode is the combination of a PN junction and two high-low junctions (P*P 
and NN*) which is made of high P* and N* layer, N and P layer of subsurface on N" 
type semiconductor substrate. P * region is the anode, N* region is the cathode, the 
base region I is made of light doping N and higher doping N region, and N region is 
the buffer base region or buffer layer. The structure is shown in Figure 1. 
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Fig. 1. Double-Base FRD 


2.2 Theoretical Analysis 


The structures and electric field distributions of P*PINN* fast soft-recovery diode, 
ordinary rectifier diode of PN and fast rectifier diode of P*IN* are shown in figure 2 
(a),(b),(c). P*PINN* owns both the advantages of P*IN*and PN. When reversing, 
owing to base region I, the space charge expand fully leading to increasing the break- 
down voltage Ura. Because of N region, there is a restriction on space charge when 
it expands into N region. It not only meets the breakdown voltage, but also reduces 
the depth of region I. While the device is forward turned on, the depth of region I is 
reduced. Since P* and N” areas inject carriers into the I area, an increase of carrier 
concentration I area, the role of enhanced conductivity modulation to the low forward 
voltage drop Vs; because the depth of region I is reduced, the storage of charge is 
decreased, thus, reverse recovery time t,, is shorter. When the device is turned off, 
charge in region N are complex and disappeared, which makes the complex time 
longer, and the recovery characteristic is softened. 
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Fig. 2. Diode structure and the electric field distribution 


When on-state diode applied a reserve voltage suddenly, a lot of minority carriers 
are stored in pn junction, it will takes some time to compound those minority carriers 
until the diode is cut off, which is called the reverse recovery, the time is called re- 
verse recovery time t,,, it contains two processes of storage and recovery. After the 
time of storage t,,the minority carriers in charge space are extracted, and reverse cur- 
rent has reached the maximum value I,,.When t > t,, space charge begins to form. 
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Because the minority carriers are compounded and disappeared, reverse recovery 
current begins falling to zero gradually, the diode recover to reverse state and can bear 
high reverse voltage, this time is called recovery time t). During ty, if the reverse re- 
covery current falls quickly, owing to line inductance, there will be a high burr vol- 
tage, even a strong shock, which will interfere the circuit. As a good FRD, it has short 
t,,, lower -di/dt, and a low burr voltage. Generally, the soft coefficient S= t,/ t; is used 
to indicate the softness of reverse recovery characteristic. Bigger the ratio is, better 
the soft characteristic is. In the same circuit condition, lower the overshoot voltage is, 
smaller the ratio is, and worse the softness is. The reverse recovery characteristic of 
diode is shown in figure 3. 


Fig. 3. Diode reverse recovery characteristics 


3 Design of Structural Parameters and Technology 


This paper adopts high power ZKR round core ZKR300A/2500V as an example, 
calculating and selecting the structural parameters and process conditions. 


3.1 Structural Parameters 


1) the main technical parameters of High power ZKR round core 


Table 1. Main technical parameters 


Parameters Notes 
Rate average forward current IF 300/500 oe 


Reverse repetitive peak voltage Verm 
Reverse non-repetitive peak voltage 
Reverse repetitive peak current Iprm 
Forward on-state voltage Vey 
Forward instantaneous voltage V¢, d; /d,=S500A/uUs 
Reverse recovery time t,, —d; /d,=100 A/ts 


Reverse recovery charge Q,, 


Reverse recovery softness S(tp/ t,) 
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2) Structural parameters 


(1) Vertical design 
The empirical formula V, = 94 p, is used in the design, the utilization factor 


n= or is introduced to optimize design ideas. Both of all make sure the Vz is con- 


stant, and make the intrinsic region thinner, thus, the high current characteristic of the 
high-power fast soft recovery diodes under high voltage and frequency is increased, 
and switching characteristics are good. 

According to the formula of avalanche breakdown voltage[4]: 


V, =94p," (1) 
Vz:the avalanche breakdown voltage; p, : the base resistance. 


According to the empirical formula of the max width of region and base region 
resistivity: 


7=% ? 
X,, =4.99p,"". 3) 
Va= Vaol(2n~1) - (4) 


W,: the thickness of based region; X,, : the space charge region broadening maximum 
width; 1 : the ratio of coefficients. 

First of all, according to formula (2), n = 0.25 ~ 0.5. while making sure Vp con- 
stant, we can adopt the thinner intrinsic region and select the smallest coefficient n, 
which can increase the switch characteristics and the high-current characteristics of 
high-power fast soft recovery diode when they are under the high frequency and vol- 
tage. From formula (4) and (2), p, ~ 190 ~ 400 Q:cm, and according to the formula 
(3), Xm = 431 ~ 812 um,so, Wy = 7Xp, = 203 ~ 215 um ~ 220 um; then, adopting W, = 
70 um, W, = 60 um, W, '= 120 um, finally, silicon thickness d = W, + W, + W, + W,' 
+ AW = 500 um. 

In the premise of the lower forward voltage, we make the P* and N* region thinner, 
reduce the recombination effects of the P* and N* storage carriers, and ensure the 
surface diffusion concentration : 1.0~10x107°/cm”. 


(2) Horizontal structure design 


The diameter of silicon wafers are selected to determine the width and angle of the 
angle lap according to the current capacity rating and voltage rating. 

According to the empirical formula of the on-state current and on-state current 
density: 


i= 77D", . (3) 


Jp : the forward on-state current density, D, : the chip diameter. 
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According to the empirical parameters, on-state current density generally range 50 
~ 120A/cm’, select Jz = 110 A/cm’. Experience (5)-style get-chip 2.4 cm in diameter. 
Because of dual-mill grinding angle, rubbing off the width of 3 mm, we adopt the 
chip diameter @30mm. 

Above all, taking the on-state power loss, heat dissipation, the process, cost pro- 
duction and other factors into account, we take the optimization parameter : p, : 200 ~ 
400Q:cm; W; : 220um; W,: 70Oum; W, : 60pm; total thickness of silicon wafers 
d= 500 um; wafer diameter 30mm. 


3.2 Controlling Theory of Impurity Concentration Distribution 


It can improve the reverse recovery characteristics, through controlling the distribu- 
tion of the impurities. The traditional P*IN* diode's thickness is thick,and life is short, 
so base storage carriers can diffuse into P* and N* region and complex in the recovery 
time. If the base region is not too thick and the minority carrier life is longer, it is 
good for reverse recovery. If life is short, it will accelerate the reverse recovery, re- 
sulting in hard recovery characteristics. So it could add a longer life and lower con- 
centration P and N region, separately, between P* region and I, N* region and I. In the 
premise of enabling P* and N* region thinner,and reduce the base storage carrier 
combination effects of the P* and N’, it is good for soft recovery characteristics and 
can ensure the forward voltage drop still lower. 


3.3 Minority Carrier Lifetime and Distribution of Reasonable Control 


Since changes in the minority carrier lifetime and minority carrier lifetime control 
method directly affect the fast and soft recovery diode reverse recovery switching 
frequency characteristics and softness, the minority carrier lifetime control technology 
is very important. The so-called minority carrier lifetime control methods that use 
different techniques to increase the base area of the recombination centers, such as 
increased use of deep level impurities (expanded gold, platinum, etc.) or have some 
defect levels (electron irradiation) to form a composite center. Thereby increasing the 
non-equilibrium carrier recombination rate, reducing the base minority carrier lifetime 
and to the base minority carrier lifetime in different regions in different areas, so as to 
minimize off-time, increase the recovery soft, increase switching speed. 

No matter adopting the expansion of gold, platinum or electron irradiation, which 
have their own advantages and disadvantages, will reduce the lifetime of the base 
region minority carrier greatly. High-energy is used at room temperature or low tem- 
perature after the end of the chip processing, which is simple, precise control and 
consistent.The platinum expansion technique is combined with irradiation[5,6]. After 
forming P*PINN* structure, platinum is expanded in short time, in order to reduce the 
minority carrier lifetime in the vicinity of PN junction, then electron irradiation is 
adopted to reduce the minority carrier lifetime of the base region away from the PN 
junction appropriately to ensure that the reverse recovery time was short and soft. The 
platinum concentration and the distribution of minority carrier lifetime are shown in 
Figure 4. 
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Fig. 4. Pt concentration and Minority carrier lifetime distribution 


4 Analysis of Experimental Results 


A design and study of process technology are made to the FRD chip of 75A- 
150A/1200 ~ 1700V, and the trial of the chip. The chip's major testing technical indi- 
cators such as shown in Table 2, provides the international well-known enterprise 
ABB similar product parameters. 


Table 2. FRD Performance compared with similar foreign products 


Technical indicators 
Num | Style Company | Vp Teng Vem tr On Ss 
(Vv CA) (CV) (us) (uc) (t/t) 
1 ZKR75-12 Chunshu | 1200 75 2.3 | 0.42 20 
2 | SSLX12F1200| ABB 1200 75 2.3 | 0.4 13 
1 ZKR150-17 Chunshu | 1200 150 2.0 | 0.6 68 
2 | SSLX12K1711} ABB 1200 150 2.0 | 0.66 73 >0.5 
1 ZKR300-25 Chunshu | 2500 300 3.0 | 3 400 
2 | S5SDF05D2505| ABB 2500 300 2.3 | 3.6 840 
1 ZKRS00-25 Chunshu | 2500 500 3.0 | 3.5 500 
2 | SSDF2501 ABB 2500 500 1.9 | 5.7 500 


From Table 2 it can be seen that the actual test specific data, models were ZKR75- 
12, ZKR150-17, ZKR300-25 and ZKR500-25 repetitive peak reverse voltage Verm 
for the 1200V, 2500V, rated average forward current Ip for the 75A and 150A. Vari- 
ous types of chips Vem, tr, Q,, and soft characteristics of the test results meet the de- 
sign specifications and design requirements. Same chip has been fully achieved the 
level of internationally famous enterprises ABB parameters similar products. The chip 
is currently used on self-produced modules. At the same time, products have been 
gradually applied to the Beijing subway and other devices and have good function. Its 
capability has reached the level of similar products abroad. 
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5 Conclusion 


In this paper, Vg = 94p,°7 and n = W;/ Xm = 0.25 for double mathematical model for 
fast soft recovery FRD use of novel high resistance thin base P*PINN* various struc- 
tural parameters were optimized; and expanded through the use of chiptwo platinum 
and electron irradiation lifetime control technology, jointly controlled base region 
minority carrier lifetime and distribution, which an be designed and manufactured 
with high current Ipy, high pressure Uprm, low-state voltage drop Very, a small lea- 
kage current Irgm, short recovery time t,, and soft characteristics of high-power high- 
performance index P*PINN* diode, the indicators and performance have reached the 
level of similar foreign products. This indicates that the design method and the selec- 
tion of the parameters are correct, reasonable, and realistic, fully can be used in prac- 
tical design. The P*PINN* structure in this paper for the design and manufacture 
of diodes provides an important guide and reference for new methods and new 
technology. 
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Abstract. In this paper, a integrated gate drive circuit of reverse-conducting 
GCT with 1100A/4500V is analyzed and designed. Firstly, the turn-on and turn- 
off circuit is analyzed and designed according to the requirment of device drive, 
and using PSPICE to make simulation, moreover, the influence of inductance 
L, and L,, capacitor Co, and switching Qs parameters to the gate drive current 
were analyzed, as well as the analysis and optimization of these parameters. 
And then, for FPGA, as the core control chip of the control circuit, using the 
software QuartusI1 to writing code and completing the function simulation. Fi- 
nally, get a complete GCT integrated gate drive circuit with the turn-on and 
turn-off circuit and the logic control circuit, and provide a design method for 
the development of driving circuit design of integrated GCT. 


Keywords: GCT, integrated gate drive circuit, FPGA. 


1 Introduction 


The integrated gate commutated thyristor IGCT) is a power switch device which 
directly integrate gate commutated thyristor (GCT) with gate drive circuit with big 
current capacity[1], high switching frequency[2], low conduction drop and good EMI 
characteristics[3]. Now it has begun to be widely used in many high-power fields. 
Gate drive circuit is a important part of IGCT and has direct influence on performance 
of GCT[4]. Therefore, the research and development of its high-performance gate 
drive circuit has broadly practical significance to further expand the application range 
of IGCT. 

Based on the working principle of GCT in the paper, a design to gate drive circuit 
of reverse-conducting GCT with 1100A/4500V is provided, and simulation analysis 
and parameter optimization are made while the core control chip with FPGA as con- 
trol circiut to achieve effective control of GCT.Finally a complete drive circuit of 
integrated gate commutated thyristor is got, which includse the switch circuit and the 
logic control circuit. 


2 Drive Principle of GCT 


Based on the structure and working principle of GCT[5], in order to control the turn- 
on and turn-off of GCT effectively, gate drive current amplitude and the size of the 
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Fig. 1. Gate drive current waveform 


rate of current rise di/dt have strict demand, the gate drive current waveform is shown 
in Fig 1[6][7]. 


Turn-on Process: When gate drive circuit of GCT has received the opening signal, 
gate drive circiut make the maxium of gate drive curent Ig, GCT instantly is turned on 
during the time 0 - T), at the time, gate drive circuit remains within the time T)-T> to 
provides a high gate drive current to GCT until the end of T,, which GCT is fully 
turned on; After GCT’s turn-on, in order to avoid the GCT automatically turning off 
for the anode current of GCT is less than the holding current, gate drive circuit con- 
tinue to provide a gate holding current(Ipgy) of the amplitude of 2A to 6A to GCT in 
the time T,-T3 , and the GCT is to be on-state at the time. 


Turn-off Process: When gate drive circuit of GCT has received the turn-off signal, 
the gate drive unit cut off the gate drive current quickly ,and adds reverse current to 
the gate-cathode ,then GCT is turned off within the time T3-Ty. 


3 Integrated Gate Drive Circiut of GCT 


3.1 Integrated Gate Circuit Diagram 


Integrated gate drive unit of GCT is made of the fiber optic transceivers, the power 
circuit, the logic circuit,the turn-on and turn-off circuit, the state display parts and so 
on, which is shown in Fig 2. 
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Fig. 2. Integrated gate circuit of GCT diagram 
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When the fiber receiver has received the light open command signal, the light sig- 
nal is transformed into electrical signal, and input it to the logic control circuit and 
judge it, if it meets the open condition,the logic control circuit sends out the open se- 
quence pulses to the turn-on and holding circuit, which produces gate drive current to 
achieve turn-on of GCT; After GCT’s turn-on, the turn-on and holding circuit contin- 
ue to provide the gate holding current, which make the GCT on-state.The gate state 
detection circuit detected the turn-on signal of GCT, and transmits it to the logic con- 
trol circuit, then the logic control circuit transmits feedback signal to the optic trans- 
mitter, which is sent to external control system while the state display circuit displays 
the working state of GCT for the external work staff; If it does not meet the openging 
condition, the logic control circuit does not response to turn-on demand, but continue 
to wait for next turn-on command signal, and judge it again until the turn-on com- 
mand signal is effective. The turn-off of GCT is similar to its turn-on, just in the turn- 
off process, the operation signal is turn-off signal, and the logic control circuit 
controls the turn-off circuit. 


3.2. The Turn-On and Turn-Off Circuit 


3.2.1 Working Principle 

The turn-on and turn-off principle diagram is shown in Fig 3. The turn-on and turn-off 
circuit of integrated gate drive circuit of GCT is made of two buck circuit[8], which 
mainly complete the turn-on and turn-off process of GCT, the turn-on process in- 
cludes strong opening large current chopping and holding opening small current 
chopping. 


a, la 
L, 
f 
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Fig. 3. The turn-on and turn-off principle diagram 


Turn-on Process: In strong opening large current chopping process, Q; and Qor are 
on and Q, and Q;are off, L; is charged, Q; and Qorr are turned off when I,; is the max- 
imum (Igy), L; is discharged, and generate strong pulse current into the gate quickly, 
GCT is turned on instantaneously, I,; will discrease, Q; is turned on when I,; dis- 
creases to certain value, L, is charged in constant voltage(Vpc—Vorr), the gate current 
gets up slowly to a certain value ,then Q, is turned off, L, is discharged again, I,, will 
discrease again. Therefore, by togging on and off, Q, will keep the turn-on gate cur- 
rent within specified limits, which GCT is completely turned on; In holding opening 
small current chopping process, Q, and Qo is off, then through sense resistance Ry 
to judge the size of holding current (Iggy), which is compared with the threshold of 
gate current that is seted.Therefore, by controling Q,’s on and off to make GCT be 
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on-state. L,is charged in constant voltage (Vpc-V or) when Q; is on, on the contrary, 
L, is discharged , Q; is always on in the process. 


Turn-off Process: Q;, Q> and Q; is off, Qoff is on, the bias voltage Vo is reversed to 
the gate-cathode of GCT to implement that GCT is turned off by "flow". After GCT’s 
off, Qore will continue to be off to ensure that gate-cathode of GCT is always negative 
bias and reliable closed. 


3.2.2 Analysis and Optimization of Parameters 

According to the gate drive curent requirement of on and off of GCT, basic part pa- 
rameters for basic drive circuit are given through theortical anaysis and calaculation, 
they are: Vpc=20V, L)=0.20H, L2=100H, Cog=16000F/35V, sense resistor R7=1Q, 
Q), Q» and Q; is a respectively MOSFET with Vpss=100V and Rps=0.055Q, D, is a 
diode with Verm=45V and Ipsm=150A ; D, is a diode with Verm=45V and Ipsm=25A, 
D; is a common diode, all above is the basic model parameters for circuit simulation. 
On this basis, related components parameters (Lj, Lo, Qore and Core) for the influence 
of the performance of drive circuit are analyzed and optimized. 


(1) The Influence on Drive Current from L, 

L, mainly determines the maximum aplitude of gate drive current(Igy), under the 
same parameters, L, is given different values, the maximum amplitude of the gate 
drive curent is changed, as shown in Fig 4. 


IFGHMINIA 


Fig. 4. the change of Igy with the change of Fig. 5. the change of Ipguwm with the change 
L 1 of ln 


It can be seen from Fig 4 that Igy is the maximum when L, = 0.1uH, all above 
meet the requrement of gate drive curent amplitude, but in the first gate continuous 
current discharging, the gate current is zero and GCT is closed, thus L; = 0.14H did 
not meet the conditions; When L, < 0.1mH, the Igy decreases with the decrease of Ly, 
and the change of gate current is down to zero in the first gate continuous discharging, 
so all values of L, are not desirable during L,; < 0.1mH.When 0.1nH<L<0.2uH, the 
Igm decreases with the increase of L;, but in the first gate continuous discharging, gate 
current can not reach above 50A, so the value of L, is not desirable in the range. Igy, = 
400A When L, = 0.2 WH, gate current reaches above 50A after the first gate conti- 
nuous discharging, which meets the requirement of gate drive current, the value of L, 
is not taken into consideration in L; > 0.20H, thus L,; = 0.2 uH is for the optimal. 
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(2) The Influence on Drive Current from L, 

L, mainly determine the size of gate holding current, under the same parameters, Lis 
given different values, the maximum (Iggymax) and minimum(Ipgumm) of GCT hold- 
ing current are also changed, which are shown in Fig 5 and Fig 6. 


<2 
: $ 
= 2 80 
5 ™ Ps 
ra] eae ae ° 
ae 
4 - 
10 ~~. : Pre 
aa oa 20 2" 
i ° r 1 1 + t 
5 1500 1600 1700 1800 1900 
7 Lu hc _ Coft/uF 


Fig. 6. the change of Ipgumax With the change Fig. 7. the change of Igpg with the change of 
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According to the requirement of GCT gate holding current(2A<Ipgu<6A) , it can be 
seen from Fig 5 that Ipcuwin will decreases with the increase of Lo, Ipcuwin = 2A 
when L, = 10uH.It can be seen from Fig 6 that Iggumax will decreases with the in- 
crease of Ly, Ipcumax=6A when L,=10uH.Therefore, it can be obtained that it just 
meet the requirment of GCT holding current 2A<Izgy<6A when L, = 10uH, in that 
case, GCT is on-state. So L,=10uH is the best. 


(3) The Influence on Drive Current from C of 
In the gate turn-off process, C,¢¢ mainly impact the size of the gate turn-off recoil cur- 
rent (Igr).Under other components parameters are not changed, Cor is given different 
values , the change of the gate turn-off recoil current (Ig) is shown in Fig 7. 
According to the need of the gate turn-off of GCT, the gate turn-off recoil current 
(Icr) is as small as possible. Fig 7 shows that the gate turn-off recoil current is mini- 
mum when C,=1600uF, and has the least effect to the turn-off of GCT; the gate turn- 
on current is distortion when Cog < 1500uF, so the value of Cog is undesirable within 
this range; When Co, >1900uF, the gate recoil current (Icgg) will increase and even- 
tually tends to remain constant, thus the value of Cor is desirable in the range. Cop is 
electrolytic capacitor because of its high capacity, and can be charged to 20V in the 
turn-on process of GCT, the voltage of Cor across the course of turn-off of GCT is 
20V and was added to the gate-cathode of GCT, which makes GCT is turned off in- 
stantly. Therefore, electrolytic capacitor Cor is 1600UF will be the best. 


(4) The Influence on Drive Current from MOSFET 

The resistance of Qp; directly affects the size of the amplitude of gate drive current 
(Icm) and the size of Icgr during the gate turn-off course of GCT .Therefore, we must 
take into account a question that how many MOSFET is parallel should be used. Un- 
der other components parameters is unchanged, the number of MOSFET is changed, 
the change of the gate drive current amplitude (Igy) and the gate recoil current (Igpr) 
are shown in Fig 8 and Fig 9. 
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It can be seen from the Fig 8 that Igy slowly changes when the number of MOS- 
FET is more than two nearby the 400A, the gate drive current waveform is distorted 
when the number of MOSFET is between two and four, thus the number of MOSFET 
is undesirable in this range. When the number of MOSFET is five, Igm reaches the 
maximum value. At the same time, it can be seen from Fig 9 that the gate turn-off 
recoil current(Igr) is minimum when the number of MOSFET is five, thus the optimal 
for Qore is the five parallel MOSFET. 

Finally, the turn-on and turn-off of GCT are simulated and analysed and the timing 
pulse are applied on the Q;, Qo, Q3 and Qor. Fig 10 shows the timing waveform of the 
switch when GCT is a "turn on-holding- turn off" process. The GCT's gate-cathode 
structure is equivalent to a diode, therefore, using a diode instead of GCT in simula- 
tion, simulation waveforms were shown in Fig 11. 


Fig. 10. Switch timing waveform Fig. 11. GCT gate current waveform 


It can be seen from Fig 10 that Q;, Qo, Q3 and Qos are four switches of the turn-on 
and turn-off circuit. High potential represents the closed switch, while low potential 
represents the turned off switch. GCT is turned on within the 0-6611s; GCT is the state 
of holding in the 66-155us; GCT is turned off until 155s and GCT is turned off with- 
in Ips. 

It can be seen from Fig 11 that the maximum gate drive current amplitude Igy = 
400A, the gate current rise rate di/dt > 1000A/us, second trigger current Ig > 50A, the 
gate holding current 2A < Iggy < 6A and GCT is turned off within lus. These fully 
meet the gate current needs of turn-on and turn-off of GCT, and also prove that the 
selection of each component parameters is to be the best. 
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3.3 Control Circuit 


3.3.1 Block Diagram of Control Circuit 

The logic control circuits of integrate gate drive circuits of GCT use FPGA as the 
control core, the control command signal (CS) for the user input is judged by the 
software programming, and translate them into a series of control timing pulse to con- 
trol the turn-on and turn-off of Qi, Qo, Q3 and Qore in GCT gate drive circuit, and then 
realize the turn-on and turn-off of GCT and display its state, the software block dia- 
gram is shown in Fig 12. 
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Fig. 12. Block diagram of control circuit 


The main roles of timing control module: to determine whether there is the turn-on 
or the turn-off signals, the minimum gate turn-on and turn-off time whether is met, 
the detection of minimum gate turn-off interval after the re-triggered. If the signal CS 
meets the turn-on and turn-off conditions, the turn-on and turn-off signal is generated 
to control the signal timing generation module and generate the gate switching se- 
quence pulse, finally, GCT is turned on and off. Otherwise, it will filter the received 
signal CS. The gate re-trigger module will judge the gate status and determine wheth- 
er the gate status meets the re-trigger conditions, if meet, it will results to a re-trigger. 
The on and off signal timing generator module mainly generate the gate switching 
sequence pulse. The detection and protection module are used to detect the working 
state of GCT and generate the state feedback signal. 


3.3.2 Simulation Results and Analysis 

According to the requirement of gate drive current of GCT turn-on and turn-off, and 
through the logic control circuit, the turn-on and turn-off sequence of the switch are 
controlled. Timing simulation results are shown in Fig 13. 

Fig 13 shows a normal "turn on - holding - turn off" process of GCT, the switch 
timings of Qy (q)31), Qo (qr21), Q3 (ry) and Qorr (Gyo) are output from the FPGA and 
reach 20V after amplitude amplification, it can completely control the MOSFET turn- 
on and turn-off, then produce gate drive current to turn on and turn off GCT effective- 
ly. This time, the magnified switch timing are fully consistents with the switch timing 
in Fig 6. In the process, sf is feedback signal, sf is opposite with cs when GCT is 
normally turned on and turned off, otherwise, they are same, then it is sent to the con- 
trol system through external optical transmitter to facilitate the field staff to detect 
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Fig. 13. Switch timing waveform 


it. Signal lgton, lgtoff and lfault are respectively the trun-on, turn-off and fault of gate, 
and through external circuit LED display it to facilitate real time detection of GCT’s 
working status. 


4 Conclusion 


The paper is based on the basic structure and working principle of GCT, which 
presents a design of gate drive circuit of reverse-conducting GCT with 1100A/4500V 
, mainly make the simulation and parameters optimization of the turn-on and turn-off 
circuit of GCT and the design of the logic control circuit, finally get Vpc=20V, 
L,=0.20H, L,=10uH, Cog=16000F/35V, current sense resistor Rp=1Q, Q, Qo and Q3 
is a respectively MOSFET with Vpss=100V and Rps=0.055Q, Qor is five parallel 
MOSFET, D, is a diode with Vprm=45V and Igsy=150A, D2 is a diode with 
Verrm=45V and Ipgm=25A, D3 is a common diode, and they are the optimum design 
parameters, under the effective control of the logic control circuit, it achieves the turn- 
on and turn-off of GCT efectively. 


References 


1. Tong, Y., Zhang, C.: The influence of Anti-parallel diode to turn-off process of IGCT. The 
Journal Of Electrical Technology 22(11) (2007) 

2. Steimer, P., Apeldoorn, O., Carroll, E.: Nagel A: IGCT Technology Baseline and Furture 
Opportunities. IEEE/PES 2(2) (2001) 

3. Bernet, S., Teichmann, R.: Comparion of High -Power IGBT’s Hard-Driver GTO’s for High 
Power Inverters. IEEE, Transactions on Industry Application, 711-718 (1998) 

4. Zhang, C.: Rearch On Gate Drive Circuit Of Integrated Gate Commutated Thysistor. Peking 
Communication University (2007) 

5. Klaka, S., Linder, S., Frecker, M.: A Family of Reverse Conducting Gate Commutated Thy- 
sitors for Mediu Voltage Drive Applications, PCIM Hong Kong, 1-11 (October 1997) 


Analysis and Design on Drive Circuit of Integrated Gate Commutated Thyristor 61 


6. Huang, X., Zhang, X., Xie, L.: Design Of Drive Unit Control Circuit GCT. The High Power 
Converter Technology (1), 9-13 (2009) 

7. Zhuang, L., Jin, X., Wu, X., Xie, L., Huang, X.: Rearch on a Novel Gate Circuit of Inte- 
grated Gate Commutated Thysitor. Communication Technology And Electric Traction (2), 
16-20 (2008) 

8. Qiu, C., Qiu, R.-c.: Research on a Novel Gate Drive Circuit of Integrated Gate Commutated 
Thyristor 


Multi-fuel Combustion Measurement in Utility Boiler for 
Optimal Operation 
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Abstract. This paper presents a combustion measuring method for optimal 
combustion through adjusting the proportions of coal, blast furnace gas (BFG) 
and coke oven gas (COG) in a 200 MWe utility boiler. A reconstructed temper- 
ature and a flame emissivity algorithm are derived to analyze the mixed com- 
bustion in the furnace. The result shows that higher combustion efficiency can 
be obtained by setting the proper ratio of coal, BFG and COG, namely 
0.7:0.2:0.1. 


Keywords: multi-fuel combustion; image processing; temperature; flame emis- 
sivity; optimal operation. 


1 Introduction 


Combustion products in pulverized-coal fired boilers include gaseous and solid mate- 
rials, such as CO2, H20, char, fly ash, and soot. Many researchers have explored the 
variation of particle emissivity in pulverized-coal with the volatiles removed [1]. 
Temperature and flame emissivity measurements have also been carried out as a use- 
ful topic [2-7]. Combustion with several kinds of fuel has often been used for the sake 
of energy saving and emission reduction in a utility boiler [8] in recent years. A 200 
MWe power plant boiler was retrofitted for mixed combustion of coal, blast furnace 
gas (BFG) and coke oven gas (COG). 

In this paper, experimental research is performed through adjusting the proportions 
of coal, BFG and COG. Based on the temperature and a flame emissivity monitoring 
algorithm, infrared radiation thermometers and a portable image processing technique 
are applied to analyze the multi-fuel combustion conditions. 


2 Measuring Algorithm 


The experimental boiler was designed for burning lean coal. Fig. | shows the structural 
schematic and the location of burners. The 3-D temperature field can be reconstructed 
from cold ash hopper to flame corner [9]. Fig. 1(a) shows the location of the measuring 
points in the boiler. The correctness of the measured results was revised by reference to 
a black-body furnace. 
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According to Wien's law, the monochromatic radiation intensity can be expressed 
as: 


1(A,T)=e,ce 2" Kd’) (1) 


Where T is Kelvin temperature, / is wavelength, c; and cz are constants, and €is the 
flame emissivity. 
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(a) Boiler structure and location of measuring points (b) Arrangement of burners 


Fig. 1. Schematic views of boiler structure in experimental study 


The flame image can be transformed to a digital signal based on image processing 
technology, which is formed with three primary colors of Red, Green and Blue. Each 
pixel represents a monochromatic radiant intensity, and it can be illustrated as [10]: 


1(A,,T)=k,R 


(2) 
I(A,,T) =k,G 
Where, k, and k, are calibration coefficients, 1(A,T) and I(A,,T) are the monochromat- 
ic radiation intensities at the wavelengths 2, and A,, respectively. 
From consideration of equations (1) and (2), the calibration coefficients k, and k, 
are: 


k=cee Oe '" GR) 


: 3) 
k,=ce,e 7” (aRA3) 
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The pulverized-coal combustion flame can be treated as a gray body in visible light, 
and two arbitrary colors can be extracted from the homogeneous radiation image. 
Adopting Red and Green for calculation, the flame temperature can be expressed as 
follows: 


(4) 


Therefore, the flame emissivity can be obtained through the expression given below: 


k 2RA? Ice") 


é= e (5) 
k,aGA3 (ce ?'*") 


Where, c; = 3.741832x10° W.uum’/m’, c) = 1.4388x10" um.K, A, = 0.61m and A, = 
0.51 um. 


3 Results and Discussion 


Fig. 2 shows the temperature distribution at several cross sections with different fuel 
proportions. The highest temperature exists in the region of the burners, and tempera- 
ture decreases along the height of upper boiler, which is consistent with the practical 
mode in these four cases. 

The temperature of the right wall is much higher than that of the left, and the tan- 
gential circle of flame center is displaced towards the back-right wall at the first 
floor(height 16.0 m in boiler). However, the tangential circle at the third floor (height 
28.0 m) is displaced towards the front-right wall. 

Figs. 2 (a) and (b) show that addition of COG leads to an increase in the highest 
temperature of 52 K; but the location of the flame center is not influenced. Fig. 2 (c) 
shows that addition of BFG results in a decrease in temperature of 94 K. 

As shown in Fig. 2 (d), the furnace temperature was raised along with a decrease in 
the height of the flame center by further addition of COG. The percentage distribution 
of multi-fuel combustion, pulverized-coal, BFG and COG is nearly 70%, 20% and 
10% at a load of 170 MWe. 

The difference between the temperatures measured by infrared radiation and by 3- 
D distribution is shown in Fig. 3 (a) and Table 1. The error is less than 5%, and the 
two methods correctly reflect the tendency of temperature in furnace. 

The flame emissivity is considered as the direct reflection of combustion and 
radiation in the boiler. The developed method is to monitor and judge combustion 
behavior for varied kinds of fuels by means of flame emissivity. 
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Fig. 2. Distribution of 3D temperature based on flame image processing with mixed fuel com- 


bustion at a load of 170 MWe 
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Fig. 3. Flame image and temperature measurement 


(b) Flame image with mixed fuel 
combustion of coal and COG at 170 MWe 
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Fig. 4 shows the flame images at different height with multi-fuel combustion in the 
furnace. It can be seen that the variation of fuel types has a significant influence on 
the flame image, and addition of less BFG and addition of more COG can make the 
flame brighter. 


(a) Flame images of 2g0j (two ports of BFG + zero port of COG) 


(b) Flame images of 2g2j (two ports of BFG + two ports of COG) 


Z 


(c) Flame images of 2g5j (two ports of BFG + five ports of COG) 


Fig. 4. Flame images at different heights with multi-fuel combustion 


Table 1. Temperature comparison between 3D visualization system and infrared measurement 
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Table 2 shows the flame emissivity with multi-fuel combustion at different heights. 
The largest flame emissivity existed in the upper region of burners, where the highest 
temperature accelerated the separation of volatiles and decreased the accumulation of 
un-burnt particles in the region. Furthermore, addition of COG should introduce a 
large amount of volatiles to form soot, and produce higher flame emissivity. 

However, most fuels had burnt out at 32.0 m, and the flame emissivity changed 
very little. In general, BOG can reduce the concentration of soot and restrain 
combustion, and COG may accelerate combustion through increasing the 
concentration of volatiles. Proper mixture ratio of multi-fuel is one of the important 
factors for optimal operation in a utility boiler. 
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Table 2. The variation of emissivity in furnace with mixed fuel combustion 


Height Emissivity on coal combustion with 
in furnace | mixed-fuel of BFG+COG in furnace 


(m) 


*: ‘g’-BFG, ‘j’-COG, coefficients represent the number of operating ports. 


4 Conclusions 


A suitable method is put forward to analyze the combustion behavior based on flame 
image digital processing technology in multi-fuel combustion in a power plant boiler. 
The addition of BOG may lead to a reduction of flame emissivity and decrease the 
temperature; and increment of COG should alter the procedure. The proportion of 
mixed fuels; coal, BFG and COG; for optimal combustion is nearly 0.7:0.2:0.1 at a 
load of 170 MWe in the boiler. 
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Abstract. Future spacecraft are envisioned as autonomous, miniature, and intel- 
ligent space systems. This paper describes the design and implementation of a 
model predictive control (MPC) system for satellite attitude control. The MPC 
algorithm is designed to successfully deal with constraints due to the small con- 
trol torque given by magnetic torquers and the Earth’s magnetic field. Laguerre 
functions are proposed to simplify the implementation of the MPC controller 
for on-line computation. A control system processor is designed as a peripheral 
hard core of the system-on-chip for satellite on-board data handling. Targeting 
the FPGA technology, this processor runs up to 120 MHz. 


Keywords: Control System Processor, FPGA, Model Predictive Control, 
Satellite. 


1 Introduction 


The development of what is known as nanosatellites, with typical mass less than 
10kg, is quickly transforming the Space development scene as it allows engineering 
and science researchers to send into orbit various payloads within a short time at low 
cost. In particular, in the last 10 years many universities like Stanford [1] have carried 
out university satellite programs, and nanosatellites have been built and successfully 
launched into space. Such small satellite can be controlled by various actuation me- 
thods, including thrusters, reaction wheels, magnetic torquers, or a combination of 
above. Currently electromagnetic actuator is the most effective approach, and has 
been adopted for many nanosatellite missions, e.g. the CanX-1 [2], AAUSat [3], 
Compass One [4] and so on. 

The magnetic torquer interacts with the earth’s own magnetic field in order to gen- 
erate a control torque acting on spacecraft. An advantage of magnetic torquers is that 
they require no fuel. They do require electrical power, but there is no exhaust pollu- 
tant and by providing a couple they are not sensitive to movement of the centre of 
mass. One drawback of this control technique is that the torques which can be applied 
to the spacecraft for attitude control purposes are constrained to lie in the plane ortho- 
gonal to earth magnetic field, and hence the satellite is under-actuated. In the equa- 
torial plane, the magnetic field line always lie horizontally, north-south. Consequently 
a spacecraft whose orbit lies in this plane cannot use magnetic torquers to counteract 
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the north-south component of their disturbance torque, or to dump this component of 
momentum. For an inclined orbit, suitable variation of the magnetic field allows con- 
trollability in the long term, but presents a significant challenge from a control pers- 
pective [5]. The model predictive control is proposed to solve such problems for 
space missions [6-8]. 

The MPC algorithm results in very complex matrix operations, which limit its fea- 
sibility for small satellites. In real-time control, to execute extremely fast control laws 
for feedback systems, a control system processor (CSP) was designed [9, 10]. The 
excellent performance of the CSP is achieved by implementing simple mixed data 
formation, with 24-bit fixed point data for state variables and 11-bit low precision 
floating data for coefficients. The CSP takes advantage of single processing element, 
i.e. multiply and accumulation (MAC) to execute all the arithmetic operation. In [11], 
another dedicated control system processor was developed based on 1|-bit processing 
[12]. In this paper to effectively implement the MPC, the controller structure is sim- 
plified using Laguerre functions [13]. At the same time, a system-on-chip (SoC) is 
proposed for the satellite on-board data handling system (OBDH). A dedicated model 
predictive control system processor (MPCSP) is designed as a peripheral hard core of 
the SoC for attitude control. 

The remaining paper is organized as follows. Section 3 proposes a novel MPC ap- 
proach using Discrete-time Laguerre networks in magnetic attitude control problem. 
Section 4 introduces the MPCSP design. Section 5 presents the simulation results for 
attitude control of a student-built nanosatellite. Section 6 concludes. 


2 Model Predictive Control 


2.1 Discrete-Time MPC Using Laguerre Functions 
Considering a linear system described by the discrete-time state space model 
Xm(k + 1) = AyXm(k) + Buck) 
y(k + 1) = Cn Xm(k) (1) 


According to Ref. [14], the er model is built as follow. 


ga EE > ae aoe 
eco | leek le, TG B0" ae +{P8] 3, auc 
y= fo = 0 afm? Q) 


A set of Laguerre functions can be used to formulate MPC problem. At an arbitrary 
future sample instant, the difference of the control signal can be expressed as 
Li(k)y O - 0 
Auth =| 9 Le@r 0 
0 0 ie Lin (Kk)? 
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L,(k) represents the Laguerre network description for the ith control. L;(k) is given 
by L;(k + 1) = AjL;(k), where matrix A, is (NxN) and is a function of parameters 
a and B = (1—4?), and the initial condition is given by L(0)" = /B[1 - a0? — 
a3 ...(—1)N-4aN~7], and 


a 0 0 0 0 

B a 0 00 

A, =| —28 B a 0 0 
a7B -aB B a O 


—a°B a®B -aB B a 


The scaling factors a and N can be selected independently for each control signal. n is 
the parameter vector and yn, comprises N Laguerre coefficients. n = [ning ie i n, | 
and n, = [cy Cz C3... Cn] 


The cost function is defined as 
J == "xn + 2n'Tx(kj) (4) 


where, X = YP, &(m)Q&(m)" + R,, T= E(m)QA™, &(m)" = Lp A" BL, 
and N, is the prediction horizon. 

It should be noted here that k; represents the current sampling time and k 
represents future sampling time. 


2.2 Problem Formulation 


In this paper, the idea is to design a controller, which guarantees the orthogonality 
between the geomagnetic field vector b and the magnetic control vector T, . and make 
the amplitudes of actuator within the saturations. In this design, the control variable is 
T, = u(k,) = u(k; — 1) + Au(k;) and the constraints become 


b(kj)"u(ki) = 0 (5) 
ae Mmax 2 
j\< 
lacocey 1240 < [mee] PoC) ©) 
where m is the magnetic dipole. This equation can be expressed in term of U as 
[b(kj)" 0 --- OJU=0 (7) 
B(b(k;)) O- 0 Mmax 2 
< 
-B(b(k,)) 0 - ofUS [me | IbC (8) 
where, U = Gn + [u(k; — 1) u(k, — 1)7 -- u(k, — 1)7]", 
Deo Li)" 0 eas 0 
e(k, +k) = 0 ay L2(i)" 2 0 ; 
0 0 i De be) 


G=[g(1)" (2) + g(Np): I". 
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When the constraints are considered, the optimization procedure is to minimize the 
cost function J in Eq. 4 and operate quadratic programming, which is able to handle 
both the equality and inequality constraints to get the optimal solution n__.. 

opt 


The difference of optimal control signal Au(k;) is given by 


L,(0)" 0 oes 0 
Au(kj) = = ea : ° Nopt ) 
0 0 + Lm (0)7 


And the optimal control signal u(k;) can be calculated as u(k;) = Au(k,) + 
u(k; — 1). Finally, the optimal vector of the coils’ magnetic dipoles Mopt can be 
calculated by 
i 

Mopr (ki) = EGaye Bebtki)) ulki) (10) 
By using Laguerre functions, if the prediction horizon the Number of terms N is set to 
be 5 for each control signal, only five parameters are needed to capture the current 
control signal and totally 15 parameters for three control signals. While in traditional 
MPC, thirty parameters are needed to capture one control signal, and totally 90 para- 
meters are needed for three signals. By reducing the number of parameters in the 
controller design, Laguerre MPC can dramatically release the computational burden 
for nanosatellites. 


3 Hardware Design 


Most MPC applications target process control, in which sample periods are low and 
the plant is physically large, meaning that processing based upon an industrial com- 
puter is adoptable. However, the proposed electro-magnetic control system is target- 
ing very small satellites, in which the controller hardware must be embedded. It may 
require very high sampling rate for precise attitude control. Also executing the MPC 
is relatively complex with heavy matrix operations. It is therefore requires some high 
performance device for control system processing. 


3.1 SoC for Satellite On-Board Data Handling 


There are two types of satellite on-board data handling systems: central and distri- 
buted. The central processing approach has one on-board computer to deal with all the 
data processing for each subsystem. The distributed processing approach, however, 
has many on-board computers. Some subsystems may have more than one processor. 
Nowadays most of the satellites adopt the distributed approach, but for nanosatellites 
this approach is not efficient due to the limited size and power. Hence, the SoC solu- 
tion is proposed not only for attitude control but also for data processing for other 
subsystems. 

The SoC can implement the whole digital functionality of the satellite on a single 
chip. The gate densities achieved in current FPGA devices have enough logic 
gates/elements to implement different functionalities on the same chip by mixing 
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self-designed modules with third party ones. In the SoC, it contains dedicated proces- 
sors for each subsystem. Fig. 1 shows the SoC architecture. It contains a general pur- 
pose processor and the dedicated processors. Its structure comprises an AMBA [15] 
compliant bus that communicates between the general purpose processor and the 
control system processor. Generally, the MPCSP is independent, but the general pur- 
pose processor as the controller of the SoC monitors the control state variables and 
satellite attitude information from the inertial sensors. These data are a part of teleme- 
try information for satellite house-keeping. 


General Purpose Model Predictive Control 
Processor System Processor 


Other Subsystem 
Processors 


Fig. 1. Soc architecture for satellite OBDH 


3.2. MPCSP Design and Implementation 


To map the control algorithm to processor architecture, it is divided into tasks or 
processes. These processes include data input and output (IO), data storage (Memo- 
ries), timer, instruction fetching and decoding, next instruction address calculation 
(Program Counter) and arithmetic operations (ALU). This partitioning should allow 
all the processes to be mapped easily into hardware, minimising the resources 
required. 

The number of concurrent operations can determine the amount and functionality 
of the hardware structures. For example, the maximum number of simultaneous data 
transactions that required for arithmetic operations determines the number of ALU 
ports. Also, communication channels between the ALU, accumulator, memories and 
IO must be assigned with specific data bus. 

The execution of the control algorithm requires the repeated execution of a set of 
instructions (program). Although the number of instructions in the control loop can be 
small in the case of implementing a simple controller, the overhead that manipulate 
the program counter maybe relatively large. We therefore must pay special attention 
to the architecture of the program counter that implements control loops. Thus, the 
MPCSP can provide a looping mechanism that introduces a short, or ideally zero, 
overhead. 

The final step is to create a hardware model that supports the operations needed to 
implement the control algorithm. This hardware model is programmed using the 
hardware description language. The resulted MPCSP is simulated and verified by 
running the MPC algorithm with the application-specific instructions. The MPCSP is 
then synthesized, floor planned and placed & routed. The final netlist can be verified 
via being downloaded into the FPGA and running the hardware-in-loop simulation. 

Fig.2 shows the MPCSP architecture. It adopts a simple mixed data format in 2’s 
complement: the coefficients are in 12-bit floating point format, with 6-bit for mantis- 
sa and 6-bit for exponent. The state variables and other data are in 32-bit fixed point 
format, with 16 integer bits and 16 fractional bits. The memories include 
program ROM, data ROM and data RAM for control program, coefficients and 
intermediate states respectively. The sample timer is used to determine the sample 
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interval for each control loop. The program counter processes one instruction at each 
clock cycle. It will halt the operation at the end of the control program, and reset to 
the address where the control loop starts at the rising edge of the sample timer. 

The MAC is the only processing element in the previous CSP design. Although this 
approach reduces the circuitry complexity, it is not efficient in terms of power and 
speed. Hence, in the MPCSP design, several processing elements are adopted for 
arithmetic operations as shown in Table 1. Each instruction uses the standard RISC 
convention of 32-bit fixed-length. All instructions have a single clock cycle execution. 


Program dr. 
timer Counter 16-bit 
: Program 
1-bit 
ROM 
en VO (PWM) 
‘32-bit 
4-bitfaddr 
Instruction 
Et Ins 
ALU REE Decoder [aay 
MP CSP addr, Data ROM 
Data RAM 


Fig. 2. MPCSP architecture. 


Table 1. MPCSP instructions 


Opcode Name Function Description 
0000 HLT No Operation 
0001 RDW Read data from data ROM 
0010 WRW Write data to data RAM 
0011 OUT Output the control signals 
0100 MUL Multiply 
0101 ADD add 
0110 SUB subtract 
O111 INV invert 
1000 SET Set the sampling frequency 
[xxl WPC Set the star value for the program counter 


The IO block has 12 inputs and 4 PWM outputs. The inputs are connected to the in- 
ertial sensors, including accelerometers, gyroscopes, magnetormeters and sun sensors. 
The outputs are connected to the magnet coils through the power amplifiers. For attitude 
control, 3 magnet coils are needed to provide three-axis actuation. The data bus and 
address bus of the MPCSP are connected to the AMBA to allow the general-purpose 
processor collect house-keeping data. The MPCSP is implemented targeting the Xilinx 
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Virtex-4 FX100 FPGA technology. The MPCSP occupies less than 8% of the FPGA 
total area. It runs up to 120MHz. The power consumption is around 183 mW. 


4 Implementation and Results 


The simulating nanosatellite operates at low Earth orbit altitude of 650km with an 
orbital inclination of 96° and has inertia matrix J=diag([6.858e-4 6.858e-4 8.164e-4]). 

The MPC tuning parameters are listed in Table 2. We can see that in traditional 
MPC design, the control horizon N, is normally chosen to be more than 30 to get a 
sound control performance. While using Laguerre function, N can be selected as 5 to 
obtain the same performance, which means 6 times less parameters involved in on- 
line computation. 


Table 2. MPC tuning parameters 


MPC Without Laguerre With Laguerre 
Sampling interval 60s 60s 

Prediction horizon (Np) 30 60 

Control horizon (N,.) 30 5 

Scaling factor for Tex Tey Tez 0.5,0.5,0.5 0.5,0.5,0.5 
Number of terms for Tex Tey Tez 5,5,5 5,5,5 

Control weighting (R) 0.1,0.1,0.06 0.1,0.1,0.06 


The control system is initialized at satellite pointing angles of 1° about each axis 
and angular rates of 0.0005 rad/s about the roll, yaw and pitch axes. 

The first set of simulation was carried out in Matlab. The MPC is then imple- 
mented on the MPCSP. A hardware-in-loop simulation platform as shown in Fig. 3 is 
developed to test the control system processor. The MPCSP is implemented in a Vir- 
tex-4 FX100 FPGA board. The satellite attitude dynamics is modelled in C program 
in the computer. The PWM actuation signals from the FPGA board are sent to the 
computer using the industrial digital IO card. The feedback signals like angular rate 
are sent to the FPGA board using the analogue to digital converters. 


Digital 10 PWM 
board 


A/D board 


Satellite Attitude 
Dynamics (in C) 


computer 


Fig. 3. Hardware-in-loop simulation for MPCSP 
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Fig.4 shows the simulation results from both the Matlab and the hardware-in-loop 


simulations. Compared to the Matlab simulation, in the difference is rather small. 
Hence the MPCSP is feasible for satellite attitude control using magnet torquers. 


Roll angle (deg) 


Pitch angle (deg) 


Yaw angle (deg) 


ie} 5 10 15 20 25 
Time (mintues) 


Fig. 4. Satellite attitude angles: comparison between Matlab simulation results and hardware- 
in-loop simulation results (red). 


5 


Conclusion 


A dedicated control system processor is developed to execute the MPC algorithm. 
The MPCSP can run upto 120MHz, while consuming 183mW and occupies less than 
8% area of a Virtex-4 FX100 FPGA. The MPCSP is used to implement a model pre- 
dictive attitude control law for a nanosatellite. The hardware-in-loop simulation 
shows that the MPCSP produces almost the same results as the Matlab simulation. 
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Abstract. The Pay-TV markets are undergoing rapid change. As companies in- 
creasingly look for resources efficiencies, cloud computing is seen making 
waves in the Pay-TV markets. This hugely popular technology will revolutio- 
nized how multimedia content is being delivered. Why? We are in dire needs to 
integrate web, social, mobile and on-demand TV together as interactive content, 
and deliver them to any television set as a single service. We have seen some 
companies offering these services and have been rather successful. What they 
still lack is to provide some form of flexibility in resource(s) provision and con- 
tent management, especially to subscriber(s). Currently, the resources are allo- 
cated based on availabilities and service agreements, while content is managed 
using on-the-top program guide tightly controlled by the service operator. This 
paper introduces a new framework, named iCloudMedia. The iCloudMedia de- 
scribes a cloud-based framework that allows “push” on-the-fly changes and 
“pulls” rich personalized multimedia content simultaneously in an interactive 
resource(s) market. We strongly believe that innovative content branding and 
configurable resource management through easy-to-manage and flexible user 
interface is the next logical step for service operators. 


Keywords: cloud computing, multimedia content, self-configurable resource 
management, system architecture. 


1 Introduction 


Digital content industry is the new driving force for the development of information 
industry and this has led to the growing importance of research in this area. Some 
trends shows that the current content distribution networks (CDN) are in a dire need 
for some sort of improvements and new frameworks need to be designed in order to 
sustain the viability of interactive multimedia services in a broadband context, in 
terms of scalability and QoS. 
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Contract No. N10007 (Research of Video-on Demand Technology for Cable Broadcasting 
Operator with Cloud Computing) and the Seoul R&BD Program (No. PA090720). 
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Cloud computing, the latest buzzwords in the technology sector, describes a new 
technology where services, infrastructures and platforms can be delivered over the 
high-speed network anywhere, anytime and on any device [1-3]. As companies in- 
creasingly look for resources efficiencies, this hugely popular technology has revolu- 
tionized how multimedia content is being delivered and consumed. Many cable and 
IPTV operators leap aboard the cloud computing bandwagon from a green computing 
angle. Why? IPTV operators find it hard to compress HD content below 6 Mb/s, and 
3DTV and HDTV cannot run on 12 Mb/s over a standard DSL line. Cable operators, 
though able to run true HD in a single stream or 3DTV in multiple streams, have to 
fight against rival IPTV operators to deliver high quality multimedia content as tech- 
nologies matured. 

Historically, these operators has been drive testing vast areas, spending huge effort 
and money trying to determine which technology would fit what services. However, 
recently, operators began to focus on how to make more revenues through adding new 
services, introducing personalization and applying green tech initiatives. 


2 Overview 


This paper discusses how interactive multimedia content can be delivered in a more 
efficient and cost effective ways using associated cloud computing technologies such 
as those presented in [4-7]. We present a new interactive middleware for providing 
interactive multimedia services on the cloud, to enable subscribers receive and share 
web TV, social TV, mobile TV and VoD streams from wherever, whenever and how- 
ever. The iCloudMedia framework is a new framework that delivers and sends “push” 
on-the-fly changes and “pulls” rich metadata content by both, the operators and the 
subscribers, through resources availability using an interactive menu in an open mar- 
ket. We strongly believe that innovative content branding and configurable resource 
management is the next logical step for operators to increase their revenues and for 
subscriber to obtain optimal viewing experience. 


3 iCloudMedia 


Fig. | shows the proposed system architecture of a cloud-based interactive multime- 
dia service. Multimedia content, streamed from any of the virtual media servers, are 
being delivered according to subscribers’ requests based-on resources availability, 
through iCloudMedia. 

iCloudMedia business model is shown in Fig. 2. A subscriber(s) is able to change 
his/her reservation and selects any available resource(s) offered in the market through 
a reservation system which manages incoming service request. 

The services available are listed as follows: 


° view available resources 
. add /remove resources 
e move resources to other locations 
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GOVERNMENT COMMERCIAL — CONGLOMERATE INDIVIDUAL 


Fig. 1. iCloudMedia provides a unified control platform to all multimedia services. 


Available 
Add/Remove 


iCloudMedia 


Fig. 2. iCloudMedia business model. 


Fig.3 describes the how all incoming service requests can be granted using a dy- 
namic management system which allocates available resources efficiently and eco- 
nomically in a coordinated fashion. It ensures that interactive multimedia services are 
being delivered according to request and continuous high utilization of resources can 
be achieved, at the same time. 
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Fig. 3. i(CloudMedia provides a unified control platform for all multimedia content. 


Fig. 4 shows an example of the user interfaces for adding storage, among other re- 
sources such as data rate or service plan in an open market using iCloudMedia. iC- 
loudMedia is flexible and is able to provide the 1) operator the ability to market re- 
sources and to introduce new revenue streams through advertisements and promotions 
2) subscriber the ability to purchase resources and personalized their con- 
tent/application(s). 


Welcome Phooi Yee 


Shop PlayList Guide 


User ID: 
States: 


PHOOIYEE 


CURRENT Detail(s): 


HANYANG UNIVERSI 


Media Communications 


STORAGE: © 500Mbyte USD 10 
O iGbyte USD 20 


Fig. 4. Sample user interface using iCloudMedia for adding resource(s) such as storage on a TV 
screen — among other resources in the marketplace 
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We list and categorize the iCloudMedia system components as follows: 


e Shop — consists of resource(s) for sale based on availability offered by ser- 
vice operator 
¢ Account Info — consists of account and personal information, managed by the 
subscriber 
¢ Home — consists a list of available services, such as 
1) “push” content such as program guide including seasonal promotions, ad- 
vertisements and sponsorships, 
2) “pull” content such as Internet, applications, games, telecommunication 
services, managed by service operator 
¢ Playlist — a list of personal content/application(s) stored in the account, ma- 
naged by the subscriber 
¢ Guide — consists of help menu and utilities, managed by the service operator 


4 Conclusions 


The rapidly evolving digital media delivery has open up new standards and services 
with the goal of improving digital media experiences for subscribers. Currently, clas- 
sical media companies are losing their role as the sole providers of home entertain- 
ment with clear indication that the current home video framework will not be able to 
meet the challenges of the scalability, high quality, and real-time distribution of legal 
video content. 

The iCloudMedia presented here is more than a mere program list and can be used 
to reconcile media companies as providers of home entertainment. It built on the 
concept of store-share-download-interact for multimedia contents as well as retail- 
need-service-satisfy for resource(s), all in a same user interface, straight to the televi- 
sion real-time. From the subscriber’s point of view, this framework provide an easy, 
flexible and ubiquitous platform for subscriber to “share” and operator to “retail”, as 
they can now share various multimedia contents — which would become a crucial 
marketing effort for the survival of Pay-TV services. 

The proposed framework is going to revolutionize the digital content industry 
through green tech initiatives for cloud subscribers. For future work, the framework 
can be extended to include optimization solutions for interactive resource(s) market to 
increase the efficiency of virtual media server and virtual storage. 
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Abstract. Shortest path search is one of key problems for big-scale city public 
transportation network (CPTN) query system. Based on the current main search 
algorithms and data models on multi-path search, an improving multi-path 
search algorithm combined A* and deviation path was proposed in this pa- 
per. With our algorithm, not only the optimal path could be provided, but al- 
so Kth approaching optimal paths could be displayed. Due to the analysis on al- 
gorithm complexity and evaluation experiment, the algorithm efficiency for 
multi-path search is much better than typical Djikstra algorithm and collection 
algorithm. Thus, it is more suitable on shortest path search for big-scale real- 
time CPTN. 


Index Terms: shortest path, A*¥ algorithm, Deviation path algorithm, CPTN. 


1 Introduction 


The current common shortest path search algorithms applied to search the shortest 
path in a CPTN are normally concerned with these algorithms: improved algorithms 
based on the traditional Dijkstra algorithm, collection algorithms with a least transfer 
times through set intersection operations, and AI search algorithms based on genetic 
algorithms [1-3]. 

Dijkstra algorithm, which was proposed for single-source problem at 1959, is the 
most popular algorithm to search the shortest path between two-nodes in a topology 
network. Its result not only includes the shortest path from the beginning to the desti- 
nation node, but also includes all the shortest paths to other nodes in the network [1]. 
Though Dijkstra algorithm has many advantages, such as stability, adaptability of 
variation on network topology and low requirement on system memory, it is not suit- 
able to obtain the feedback result of all the shortest paths among all node pair of n 
nodes in a certain time, especially, with the number of node increasing, its computing 
complexity will be o(n’). Dijkstra algorithm can be generally applied to search short- 
est paths in a certain time on a public transportation network in a city with middle and 
small scope. However, some public transportation system is becoming more and more 
big such as Beijing, Shanghai and etc. At present, some algorithms such as the collec- 
tion algorithm [2] and the ant algorithms [3] have been studied to search the shortest 
path in time with reducing the computational complexity. The collection algorithm is 
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studied to search the shortest path with the least transfer time by set intersection oper- 
ation among public traffic lines in CPTN, of course, there are many methods to pre- 
scribe the order of set intersection operation to obtain the shortest path with the least 
transfer time between arbitrary given two nodes, its complexity is the problem of 
0(q'), with m increasing, in which q is the average number of transfer nodes on a traf- 
fic line and m might be the total number of traffic lines in CPTN. Many algorithms 
like the collection algorithm restrict the value of m so that a result can be fed back in 
a reasonable time '!. Many researchers have proposed genetic algorithms for the 
shortest path based on AI. A typical representative is the ant algorithms '!, However, 
in order to reduce search scope and to improve search speed, genetic algorithm re- 
moves some little possibility nodes by using feedback mechanism. Though it has 
some stability degree for a dynamic complex network, the accuracy of studied result 
is not very reliable because of the possibility losing optimal solution occurred. 

In order to solve the problem of search K" shortest path in CPTN with big scope 
more effectively, a new optimum multi-path search algorithm has been proposed in 
this paper by combining the A* algorithm with the deviation path algorithm“), it 
improved the efficiency of multi-path search in CPTN greatly without losing the ac- 
curacy of the result. 


2 Characteristics and a Topology Structure of Urban Public 
Transportation Network 


A CPTN of a modern city may be composed of rail transit network (CRTN) and pub- 
lic bus routes (PBR), of course, including taxis. A topology structure of PBR is nor- 
mally more complex than a topology structure of CRTN, because there are mainly 
these causations:@)The number of bus routes is greater than the number of rail transit 
line; @Bus routes have different characteristics in run way such as a symmetrical run 
way which means that the bus station names are the same in run way up and down, 
and an unsymmetrical run way which means that there are some different bus station 
names in run way up or down. So different models will be considered and built de- 
pending on the characteristics of bus run ways; @There are some different or the 
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same bus station names which are neighbored on a geographical place, and which will 
bring difficulties to build models of bus routes. It is necessary on PBR to merge these 
neighbored stations into a virtual unique station code [6] separately. 

Figure | shows the bus station names neighbored Donghua University in Shanghai 
that indicates the same geographical place, i.e., Donghua University. Figure 2 shows 
the two same bus stations in Figure | have been merged to a unique bus station. 
Figure 3 is the abstract topology structure in terms of Figure 2. 


3 Modelings of PBR and the Data Structures 


PBR can be symbolized as G (V, E, L), in which,V represents the node set of PBR, E 
represents path section collection of the bus routes (section between two bus station 
nodes), L represents the collection of bus routes (each bus route includes specific se- 
quence of nodes and sections). The radix of V is m, which means that there are m 
different nodes in PBR. The radix of L is i, which represents the amount of different 
bus routes. The up-route and down-route belonging to the same bus route are treated 
as two independent bus routes. The beginning node and the destination node of ring 
route are considered as the same node. 
Bus route Li can be considered as an orderly collection of n stops: 

Li = <S,, S5, S3, Sy... Sk | S © V,k represents the number of stops belonging to 
route Li.> 

Supposed the model as following: 


1) Considering up-route and down-route as two different routes. 

2) There are no stops which have different names but close to each other 
should be merged. 

3) Regardless of the influence of the departure frequency. 

4) Regardless of vehicle types, such as air bus general bus, etc. 

5) Regardless of bus route type, travel time is proportional to the number of 


the passed stops. 
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In order to carry out algorithm computing, many researchers abstract PBR as adjacen- 
cy matrix [4]. However, adjacency matrix is suitable for sparse network computing. 
For major cities such as Beijing and Shanghai, since adjacency matrix formed by PBR 
must be huge and dense, it should not meet the requirement of real-time computing. 
On the other hand, with the rapid development of geographic information system 
(GIS), there has been commercial GIS spatial database currently. Considered the cha- 
racters of platform independence on spatial data storage, the reliability on data query 
optimization, GIS spatial database is more appropriate for super big scale public tran- 
sit data storage. The main table structure is shown as Figure 4. 


lable Bus Routs 
Pre | Lincid fable Reute-Stap 
Linename PRFKI Lincid 
TRI | Stopstart }—_—— 
Stopend TK? Beyinstop 
Linedistance Endstop 
| Distance 
lable Bus Stops 
Pie | Stopid 
Stopnume 


Fig. 4. Database table structure. 


Bus Route table is used to store all bus routes, including line ID, line name, start 
station, terminal station, route mileage, etc. Route-Stop table is used to store informa- 
tion about stop and adjacent stop, including stop ID, adjacent stop number, stop dis- 
tance and line belongs to. Bus stops table store information of all bus stops, including 
stop ID, stop name and other attaching information. 


4 Search Algorithm Analysis 


A * algorithm is a kind of graph search strategy in artificial intelligence. It applies 
heuristic search function to evaluate the emerging branches during search process, so 
as to select the most optimal one for searching. Since it effectively reduce search 
nodes and branches, A* algorithm improves search efficiency significantly. There- 
fore, A* algorithm is better than traditional Dijkstra algorithm, particularly in big 
scale network search [4]. Traditional shortest path search algorithm based on devia- 
tion path uses classic Dijkstra algorithm to compute the optimal solution firstly, and 
then seek its deviation path set to get the Kth shortest path collection. As above men- 
tioned, when Dijkstra algorithm is applied on path search against big scale network, 
its efficiency would decline. Although the optimal solution can be guaranteed, it is 
obviously that the way of traverse all nodes does not meet the requirement of real- 
time computing. Thus, A* algorithm tries to improve it by using knowledge-based 
feedback. The basic definition is as following. 


f(v)=g(v)+h(v) 
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Among them, f (v) is defined as the heuristic evaluation function of the current node 
Vv, g (v) is the actual cost from the initial node to node v in network; h (v) is the esti- 
mated cost along the optimal path from node v to the target node. When h (v) equals 
zero, which means not using any heuristic information, A * algorithm would be 
equivalent to standard Dijkstra algorithm. 

In order to meet diversity of passenger demand actually, not only the shortest path 
query should satisfy in mass transit network, but also limited Kth shortest path should 
be provided for passenger’s selection. Because the feature of multi-path algorithm 
requests multiple shortest paths computing, its efficiency would reduce with the 
search depth increasing. Our proposal firstly obtained the shortest path by a one-time 
computing using A * algorithm, then also applied A* algorithm to seek deviation path 
collection. It is proved that the combination of A * algorithm and deviation path algo- 
rithm could better meet the search result of Kth shortest path while ensuring the 
search efficiency. The efficiency comparison of A * algorithm combined with devia- 
tion path algorithm, simple Dijkstra algorithm, and simple A * algorithm is shown in 
table 1. It shows a smaller lose of efficiency can get the accurate search results of the 
pre-K times shortest path. So the requirement of multi-path selection is well met by A 
* algorithm combined with the deviation path algorithm. 

Deviation path is defined as following, v, is the source node, v, is the destination 
node, and Vv; V2V3Vq ... V; and V,V2'V3'Vq '... V; are two paths between (vj, Va). If viVov3V4 
... Vj and v,V2'V3'v4' ... Vj ' are each corresponding to the same nodes, and get different 
nodes from v; ,;, the latter path is called that the path of v,v2'v3'vy '... V, is obtained 
deviated from the path of v,VoVv3V4 ... Vn at Vi41. It is one of deviation paths of vjvov3V4 
... Vn. The path of v;v2V3V4 ... Vn treated v;,; as deviated node. There exists a shortest 
one among all of deviation paths that deviated from node v; ,;', which consists of 
ViV2V3V4 ... Vj plus link v; v;4;' plus the shortest path from v;,;' to v,. It is defined as 
the 2th shortest path. And so on could get the Kth shortest path [5]. As shown in 
Figure 5, the path 1235 is the 2th shortest path of path 145, 12365 is the 3th shortest 
path that deviated node is 4 and deviating node is 2. 


Fig. 5. Deviation Route 


5 Algorithm Achievement 


1) Firstly, the shortest path from the initial node v, to the target node v, can be got by 
using A* algorithm. OPEN and CLOSE are two linked lists. OPEN list stores nodes 
that have been generated but not yet generate successor nodes set from. CLOSE list 
stores nodes that have generated successor nodes set from, follow these steps: 


© Initial OPEN list only contains the original node, initial CLOSE list is empty. 
Origin’s weight is zero, and the other nodes’ weight is infinity. 
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@ If OPEN list is empty, it means the search fails. Otherwise, the node with the 
smallest value f will be selected as the most festive node (named BEST) from OPEN 
list, and be moved to CLOSE list. If node BEST is the target node, turns to step 3). 
Otherwise the successor nodes of BEST will be generated based on connected path 
sections of BEST. For each successor node n, the following procedure should be 
executed. 


Calculate the weight value f to reach node n. 


If node n is equal to a node in OPEN list (named m), determines whether it has the 
lowest cost value. If it is true, replaces the cost value of m with the cost value of n, 
and points the backward pointer of m to BEST. 

If n matches a node in CLOSE list (named m’), compares whether it has the lowest 
cost value. If it is true, replaces the cost value of m’ with the cost value of n, points 
the backward pointer m’ to BEST, and moves m’ to OPEN list. 

If n is neither in OPEN list nor in CLOSE list, points the backward pointer of n to 
BEST, moves n to OPEN list, and calculates n’s evaluation function f (n) = g (n) +h 
(n). Repeat @). 

@) Backtracks from node BEST, treats the searching path found as the shortest 
path. 

2) Initializes the deviation path collection G empty. Deals with the shortest path 
node sequence V;V2V3V4 ... V, which are outputted by step 1), derives deviation path 
from deviated nodes v; (2 <i <n). Puts all deviation paths into G, and notes deviated 
node (named devNode) of each deviation path. If G is empty, terminates the search. 
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3) Sorts the paths in collection G. The shortest path is 2rd shortest path, and de- 
rives deviation path from successor nodes of devNode (deviated nodes). Note dev- 
Node of every deviation path. Puts the generated deviation path into G, and removes 
2th shortest path from G. If G is empty, terminates the search. 

4) Supposed that the K-1th shortest path has been obtained. Its deviated node is dev- 
Node. The Kth shortest path is the shortest one in G, and derives deviation path from 
successor nodes of devNode (deviated nodes). Notes devNode of every deviation path. 
Puts the generated deviation path into G, and removes the Kth shortest path from G. 

5) Do not set quit boundary. Repeats step 4). Gets all paths between (v, v,). 


The algorithm flow chart was shown in Figure 6. 
The core implement program as follows: 


private GLine DoPlan(IList<StopPoint> route 
StopPoint 0, StopPoint d) 
{ 
foreach (StopPoint p in route) 
{ 
if (p. Equals (d)) { break; } 
if (!p. Equals (o)) 
{ 
//Initialize startNode with p and 
CompassLinesHelper clhelper 
IList<CompassLines> allCompassLines = 
clhelper. GetAllCompassLines (startNode) ; 
foreach (CompassLines cline in 


allCompassLines) 
{ 
//Get adjacent point pnext and check if it 
is in shortest 
if (pnext in shortest) 
{ 
//Get the shortest path from pnext to 
destination use AStar 
} 
} 
//Get the shortest path and put into 
collection G 
} 
} 
//G is not empty 
foreach (GLine gl in DeviationPlan. G) 


{ 


//Tranverse collection G to search the 
shortest path 
} 


return kroute; 
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6 Algorithm Complexity Analysis 


Time complexity of Dijkstra algorithm is O (n’). In our improving A* algorithm, if 
the average out degree of stop is b and the search depth from the beginning to the end 
is d, then the time complexity is O(bd). The Kth shortest path is the shortest path in G 
(see Section 5). Since each deviation path computing calls A* algorithm one time, the 
time complexity of the algorithm is limited to O(e x bd) after combined with devia- 
tion path. The e is total number of routes in the network. In actual calculation, the 
number of deviation paths was far less than e [7]. 


7 Algorithm Efficiency Evaluation 


In order to compare the efficiency of three above mentioned algorithms, a database 
contains part of Shanghai bus routes was established. The execution efficiencies of 
these three algorithms are listed in Table 1 and Table 2. For comparison, actual algo- 
rithm used a variety of data structures, mainly are based on adjacency matrix and 
adjacency list. When adjacency matrix structure was used to store stops, due to big 
dimension of the matrix, the algorithm initialization required very big time expense. 
Therefore, due to the poor maintainability of this algorithm, it was not suitable for 
big-scale PBR shortest path searching. On the other hand, since the adjacency list read 
the information of adjacent stops dynamically, it was instantiated only when needed, 
and the practical efficiency of the algorithm was greatly improved. 

Figure 7 showed the efficiency curve based on the algorithm execution time in ta- 
ble 1. Coordinate X axis represented data size (the number of bus routes), Y-axis de- 
scribed algorithm execution time (in millisecond). To use Dijkstra algorithm and A * 
algorithm in 30, 60, 90, 120, 150 routes respectively, the key points in two curves 
represented computing time on the shortest path search from Taopu highway to 
Jiangning road. In accordance with the curves, it was observed directly that the effi- 
ciency of A* algorithm was higher than that of Dijkstra algorithm. In the case of the 
shortest path computing for 30 routes, Dijkstra algorithm needed 774ms, while A * 
algorithm only needed 12ms. For 60 routes, Dijkstra algorithm needed 1710ms, while 
A * algorithm only needed 15ms. From the curves, the execution time of Dijkstra 


Table 1. The shortest path algorithm efficiency comparison 


Dijkstra A* 
ae elaine Ti4ms 12ms 
pete Bee) 1710ms 15ms 
ries ahr 2345ms 15ms 
Sagemanen at 4235ms 29ms 
sass caberat 18) 5180ms 32ms 


emark: Taopu Highway---Jiangning Road(one transfer). 
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Fig. 7. The shortest path algorithm efficiency curve 


algorithm was proportional to grow with bus route number increase, while the effi- 
ciency decline of A * algorithm was not obvious. Therefore, A * algorithm was more 
efficient than Dijkstra algorithm against PBR network search. 

Figure 8 described the efficiency curve based on the algorithm execution time in 
table 2, while the axis meaning was the same as in Figure 7. It mainly reflected the 
difference against execution time on the shortest path using pure A * algorithm and 
our A * combined deviation path respectively. 


Table 2. Kth shortest path algorithm efficiency comparison (k=3) 


A* A* combined with Deviation 

30 Bus Routes 1104ms 561ms 
(Stops number:356) 

60 Bus Routes 2181ms 1990ms 
(Stops number:628) 

90 Bus Routes 3224ms 2174ms 
(Stops number:864) 

120 Bus Routes 4475ms 2430ms 
(Stops number:991) 

150 Bus Routes 5620ms 2884ms 
(Stops number:1118) 


The key points in two curves represented computing time on the 3th shortest path 
searching from Shanghai Stadium to Jingan Temple using Dijkstra algorithm and A * 
algorithm in 30, 60, 90, 120, 150 routes respectively. From two curves’ trend, the 
execution time of these two kinds of algorithm both increased rapidly during the data 
size from 30 to 60 routes, However, when data size was greater than 60 routes, the 
efficiency of pure A * algorithm decreased proportionately, the efficiency of A * al- 
gorithm combined with the deviation path had no significant drop. Thus, A * in 
combination with the deviation path shows higher efficiency when computing Kth 
shortest path from source stop to destination stop. It is more suitable for practical ap- 
plication in the path selection. 
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Kth shortest path algorithm efficiency comparison(k=3) 
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Fig. 8. Kth shortest path algorithm efficiency curve 


8 Conclusions 


Based on the BPR modeling, this paper proposed a new improved multi-path search 
algorithm. It combined both A * algorithm and deviation path algorithm. The result 
not only met the shortest path search of mass transportation network, but also ulti- 
mately obtained the Kth shortest path. Through the analysis of different shortest path 
search algorithms, our algorithm provided various decisions when ensuring the search 
efficiency. Thus, it is a better query solution on BPR path real-time selection, and has 
a good practical value. The future work is to extend to adapt more complex queries, 
such as the requirements of different routes, calculating the different costs, adding 
mix query capability against rail and bus systems. 
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Abstract. Power Quality information is not only an important part of AMI 
(Advanced Metering Infrastructure) in smart grid but also is an important part 
of technical support system of the electricity market, so the power quality moni- 
toring and evaluation is necessary of smart grid development and operation of 
power market. based on subjective weighting method (analytic hierarchy 
process, An expanded least deviations algorithm has been used to combine the 
multiple expert judgment matrixes to get the subjective weight vector in AHP) 
and objective weighting methods(variation coefficient method has been used to 
get the objective weight vector), an optimal combination determining weights 
method resulting in consistency between subjective and objective evaluation in 
the least squares sense is present. then bring out a improved fuzzy synthetic 
evaluation method based on reliability code. A new hierarchy model for power 
quality evaluation is presented to quantify and evaluate the power quality. Case 
results clearly show that the proposed method is attractive and effective. 


Keywords: smart grid; power quality; analytic hierarchy process; variation 
coefficient method; expanded least deviations algorithm; optimal combination 
weights method. 


1 Introduction 


Power Quality information is not only an important part of AMI (Advanced Metering 
Infrastructure) in smart grid and is an important part of technical support system of the 
electricity market, but also is one of the constraints in making tariff poli- 
cy[1],[2],[3],[4]. Improving the quality of power is also an important part of auxiliary 
services of electricity market.so the power quality monitoring and evaluation is neces- 
sary of smart grid development and operation of power market[5],[6]. 

The power quality evaluation method is mainly the fuzzy comprehensive evalua- 
tion method[7],[8],[9], shepard model[10], projection pursuit[11], entropy prin- 
ciple[12], and the matter-element analysis[13], attribute recognition theory[14], etc. 
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The comparative analysis is carried out based on same measured data, which shows 
there is uncertainty in the evaluation results which relay on the subjective factor influ- 
ence of evaluation process[15]. Fuzzy Comprehensive evaluation is one of the most 
widely used methods in the decision-theoretic, but it is usually be influenced signifi- 
cantly by the matrix of fuzzy relation and index vector. For a sequential segmentation 
category, the principle of the lowest cost, the principle of maximum degree of meas- 
ure and the principle of maximum degree of membership sometimes can get unrea- 
sonable conclusion, even sometimes can get error conclusion, because they conceal 
the difference of two degree of membership[16]. In this paper, A new hierarchy 
model for power quality evaluation is presented to quantify and evaluate the power 
quality, based on subjective and objective weighting methods,this paper presents a 
combinational evaluation method resulting in consistency between subjective and 
objective evaluation in the least squares sense, which make comprehensive evaluation 
result is more reasonable and credibility than single assessment method. then bring 
out a improved fuzzy synthetic evaluation method based on reliability code. The pro- 
posed method can overcome the shortages of the traditional fuzzy synthetic evalua- 
tion, Case results clearly show that the proposed method is attractive and effective. 


2 Improved Fuzzy Comprehensive Evaluation 


2.1 Optimal Algorithm for Combining Subjective and Objective Weight 
Information 


Subjective Weighting Method. In the process of the fuzzy evaluation, determining 
the proper weight is one of the most important procedures and has direct impact on 
the results of comprehensive evaluation. The judgment matrix method of the 
AHP Analytic Hierarchy Process is used in this paper. AHP is a method that 
through the analysis of complex systems and relationship between the factors con- 
tained, and then the system will be broken down into different elements, and these 
elements are incorporated into different levels, so as to form a multi-level analysis 
model objectively. According to a certain scaling theory, all the elements of each 
level will be compared so as to get the comparative scales indicating relative impor- 
tance of the elements and establish the judgment matrix. By calculating the maximum 
eigenvalue of judgment matrix and the corresponding eigenvector to get the orders of 
the elements of each level to a certain element from the upper level, and thus the 
weight vector is determined. For the detailed algorithm about AHP can get in 
[14][17]. 

For a number of experts or a group of experts involved in evaluation process, the 
following expanded least deviation algorithm[18] can be used to combine the multiple 
expert judgment matrixes to get the final weight vector. The algorithm in one hand can 
integrate the experts’ judgment information to the utmost; in another hand can reduce 
the dependence on individual expert. 

Suppose P, =[b; 


tlpep “ER Lz mis the judgment matrix group, mis the 


is the weight matrix to be solved; 


nxn 


number of the evaluation experts, P* = [a,/ O;] 
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i=l 


p={we R" 


~@, =1,a, 20,i=1,2,----- nj is weights vector set, 


-{ae R" 


Si, =1,/, 20,1) =1,2,----- m| is weighted vector set; 
i=l 


e=(L1--- 1)’ eR’. 
Suppose the Disturbance Matrix 
Qo. 
E, = [jp din = [2,13 —hon 0 = 1,200 m , Constructing the least squares model 
, Q. 


J 
as follow 


min f(@) = SAO bi <2) @eD AcE Q (1) 
2 a Q; i 


This solution to the above optimization problem is given by the Lagrange function: 


L(@.k) = W(@)+ KY @, =) =>) YA bj & 4k @,-1) (2) 


i=l i, j=l 1=1 Qa; Q; i=l 


where k is the Lagrange multipliers then making > = 0,one can obtain 
(a) 


7 


LdA6,,4 Ds 2a 0 (3) 


j=l [=I Q; 


We can obtain k=0 then @ must satisfy: 


YDAG,, 2 = bys *)=0 (4) 


j=l l=1 0; j 
0,(k) a.(k) ; 
S k b bites Se ree 
uppose 2, (@(k)) = Y dal ‘ii,l 0,(k) i @,(ky cai, n 


(5) 


The steps of using expanded least deviations algorithm for combining index weights 
are as follows: 


Stepl: give failure value ¢€ let k=0 _— give initial value at random 
(0) = (@ (0), @, (0),---,@, (0) 
Step2:compute @ (@(k)), i=1,2,------ n ,if 


9,(a@k))| <0 for all i, accept the ite- 

ration value, it shows @(k) is mineral solution of h(@) end, or wise _ turn to 

step3, 
Step3:suppose 


= max lg, then compute 


k 
BO =(LLAC 1) LAG 2 


j#s l=1 GD j#s l=1 (> 


(6) 
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g(k)o,(k), i=s 

Making vu=| (7) 
@,(k) i#s 

ak+)=9(b)/ ¥O(k) i=12.-n 8) 
j=l 


Step4: let k =k+1, back step2. 


Objective Weighting Method. Objective weighting method is based on the actual 
data via mathematical method to get the index weight. generally speaking, they are 
based on the index variation degree or the relationship to determine the importance of 
index, objective weighting method due to its absolute objectivity and may violate the 
index economic significance (or technical significance . at the same time, the 
samples change may cause weight change,and the weight is unstable. In order to 
highlight the index of relative variation and variation coefficient of variation, 
Variation Coefficient Method is be adopted in the paper and its calculation steps: 


Step1: For an evaluation matrix X = x. 


ij mxn ? 


m is evaluated object number, 7 is the 


index number. the first thing is to make indexes the same direction, then Normailize 
and make it standardized matrix Z = z, 


ij mxn * 
42%) a i=1,2,------ m j=l ” (9) 
i=l 
Step2: For standardized matrix Z + a) mn compute mean and variance of 
samples: 
— J m 
Z=-dLy , 
j 7 ij J=1,2, se Bees n (10) 
(1) 
(12) 


Step4: The weight of each index is obtained according to the variation coefficient: 
@.=s./> Vv. : 
J J 5 J el Ere 

ja J=1,2, n (13) 

Optimal Algorithm for Combining Weight Information. Using subjective weight- 
ing method to determine the weight coefficient of index can reflect policymakers 
inclinations, but evaluation result is very uncertain.using objective weighting method 
to determine the weight coefficient of index has mathematical theory basis, but it 
ignores policymakers inclinations. In order to let the decision-making process and 
result more scientific, a reasonable approach is to combine the weight informations 
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from the different weighting methods according to certain optimal algorithm.So the 
results not only can reflect the subjective and objective information, but also can 
make the unity of the value and information. Based on least-square method,a combi- 
natorial optimization model[19] to determine the index weight is established. 

Suppose the index weight vector using the subjective weighting method is 


u=[u,,u,,---,u,]', the index weight vector using the objective weighting method is 


v=[V,,V.,°°5¥, v,]', the final weight vector After optimizing is O=[,,0,,°°-,®, Vs 


n 


standardized matrix Z = z.. mis evaluated object number, n is the index number. 


ij mxn 


Constructing the least squares model as follow 


min g(@) = > flu, -@)z,F +10, -@,)z,F} 


i=l j=l 
Sin Ls ¥0;=1 Q. JH=LQe n (14) 
This optimization problem is given by the Lagrange function: 


L(@,A) = g(@)+ ay @=1) 


j=l 


= {((u,-@,)2,P +1, -@,)2,P }+44Q a, -1) (15) 
i=l j=l j=l 

? haa : OL OL 
where A is the Lagrange multipliers then making =0 and =0, can 

dQ, aa 

obtain 

-))2tu,+v,-@,)z, +44 =0 (16) 
ay o;,-1)=0 (17) 


j=l 


set A= diag Saka, seeees Sa ; E=[Lle iN F 


T 
‘ m 1 m 1 m 1 
O= [@,,0,,--,@,]" B= tag] SS FY)» Day a + MadZias satiate D5 + 
i=l i 


i=l i=l 


Using matrix form 


(19) 
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Reference [16] showed that for a sequential segmentation category, the principle of 
the lowest cost, the principle of maximum degree of measure and the principle of max- 
imum degree of membership sometimes can get unreasonable conclusion, even some- 
times can get error conclusion, because they conceal the difference of two degree of 
membership. therefore proposed the principle of reliability code. 

If evaluation categories (y,, y,,---:~ ,y,) 18 a sequential segmentation of the 


attributes space ¥, y, is the membership, here membership requires to unitary. / is 
reliability code, considering the generally range of A is 0.5<A<I , here 
A=0.6 ~ 0.7. If y,,y5,-7+:", y, Meet yp >yy>ee-- >y,x, and 


ky=min{k:Du,(,) 2 1<k< K) (20) 
l=1 


One can obtain that x, belongs to category Vho * 


If yoyo »y, Meet y,<y,<e0- <y, , and 
K 
ky =max{k: > H,,(y) A 1<k<K} (21) 


One can obtain that x; belongs to category y,, 


3 Application in the Comprehensive Evaluation of Power Quality 


The elements impacting the quality of power commodities are in large and complex, 
namely, power quality needs of a number of indicators to measure. How will the sub- 
indicators reasonable description and organization to reflect the quality of power 
together is actually a very complex multi-attribute Comprehensive evaluation and 
decision-making. 

The hierarchical model for power quality comprehensive evaluation is established 
as shown in Figure |. There are four first-level indicators of power quality; each first- 
level indicator has some second-level component indicators. 
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Fig. 1. The hierarchy model for power quality 
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Conducting Comprehensive evaluation of static indicators figured up with the data 
from the 95% probability value of the monitoring period and used SARFI (System 
Average RMS Variation Frequency Index) for transient indicators. 

Firstly, With the data provided by Electric company which recorded by monitor- 
ing of a PCC(Point of common coupling) in a distribution power network to illustrate 
the evaluation methods. Detailed data are shown in Table1. 


Table 1. Measured data of PCC of distribution network 


Sample Ju Tn Ts Tia Tis lis Tz Tis Tho Tuo b L 

1 5 2 1.67 0.89 0 4 3 0 0 1.15 0.116 2.47 
2 11 2 1.40 0.78 1 1 1 0 0 1.55 0.06 2.39 
3 13.7. 2 LAT 0.78 0 1 0 0 0 1.20 0.65 2.40 
4 73 1.1 1.56 0.85 0 2 2 0 0 1.15 0.092 3.50 
5 6 2 1.53 0.75 1 3 0 0 0 0.90 0.07 3.19 
6 3 1 0.96 0.71 0 1 0 0 0 0.94 0.06 1.61 
7 5 2 1.24 0.77 0 2 1 0 0 1.15 0.066 2.74 
8 5 2, 1.17 0.73 1 1 0 0 0 1.05 0.06 2.59 
9 4 2 2.08 0.96 5 8 0 0 0 1.87 0.12 4.28 
10 6 2, 2.17 0.97 9 10 4 0 0 2.01 0.132 4.34 


The weights of 10 second-level indicators of voltage indicators is calculated by 
AHP Methods, the scores given by four experts used and the expanded optimization 
algorithm expounded in section II .By using the same way we can get the weight 
vector of the first-level indicator) . 

In accordance with the above improved fuzzy Comprehensive evaluation method, 
10 group data of power quality is used, the Comprehensive evaluation results shows in 
Table 2, it is consistent with the conclusions provided in references 14. 


Table 2. Comprehensive evaluation results of power quality 


sample 1 2 3 4 5 6 7 8 9 10 
Results m g g m m e g g q q 


For Table 1, if using conventional fuzzy Comprehensive evaluation method, about 
more than 40% (5 group) data will obtain unreasonable even wrong evaluation results. 
For example, to evaluate sample 8th by proposed method in this paper, one can 


obtain k, =2, and therefore the power quality of this time period belong to C, cate- 


gories, promptly, the Comprehensive evaluation results of the sample is good. 
According to the conventional method of fuzzy Comprehensive evaluation of the 


evaluation results should be C,, in fact we can see that sample belong to C, or C, are 
the properties of more or less equal measure, and samples belong to C, and C, togeth- 


er equivalent to 0.67, accounting for the entire attribute more than half, so that the 
sample belongs to C, attribute category is unreasonable. 


104 W. Chen and X. Hao 


4 Conclusion 


An improved fuzzy Comprehensive evaluation method is proposed in this paper. Based 
on subjective and objective weighting methods, this paper presents a combinational 
evaluation method resulting in consistency between subjective and objective evaluation 
in the least squares sense, which make comprehensive evaluation result is more rea- 
sonable and credibility than single assessment method. then bring out a improved fuzzy 
synthetic evaluation method based on reliability code. The application in Comprehen- 
sive evaluation of the quality of the power commodities illustrates the effectiveness 
and the feasibility of the proposed method. 
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Abstract. Dynamic voltage restorer(DVR)is one of important power quality 
control equipment. It can compensate voltage sag accurately and quickly, 
which demands that voltage sags in grid should be accurately and real-time de- 
tected and reference voltage wave should be generated. in this paper a sliding 
window iterative DFT algorithm based on adaptive sampling algorithm is 
present. magnitude and phase of the Fundamental component of a sample se- 
quence of can be extracted through sliding window iterative DFT algorithm. 
under the non-synchronized sampling conditons, adaptive sampling algorithm is 
be used which can adjust automatically the sampling time. So the algorithm can 
reduce errors caused by the non-synchronous sampling. The simulation results 
show that the algorithm can effectively detect the reference instruction voltage 
and has good real-time, tracing and anti-interference performances. 


Keywords: power quality; dynamic voltage restorer; voltage sag; Discrete 
Fourier Transform; sliding-window iterative DFT; adaptive sampling algorithm. 


1 Introduction 


Power quality has become an important issue over the past several years. One of most 
important power quality issues in power system is the voltage sags, since voltage sags 
cause severe effects to end users, general, sag characteristics, e.g., stating and ending 
points, and depth, are typically determined using an RMS envelope. Dynamic voltage 
restorer (DVR) is one of important power quality control equipment. The key point of 
dynamic voltage restorer to compensate voltage sag is quick detection of voltage sag 
and accurate generation of correct reference voltage wave. At present most widely 
used method is instantaneous reactive power theory[1], wavelet transform[2] and dq 
transform[3].These algorithms get more attentions for their real-time. In this paper a 
sliding window iterative DFT algorithm based on adaptive sampling algorithm is 
present. Magnitude and phase of the base wave component of a sample sequence of 
can be extracted through sliding window iterative DFT algorithm. Under the 
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non-synchronized sampling conditons, adaptive sampling algorithm is be used which 
can adjust the sampling time automatically. So the algorithm can reduce errors caused 
by the non-synchronous sampling. Simulation results show the feasibility of the 
algorithm. 


2 Sliding Window Iterative DFT Based on Adaptive Sampling 
Algorithm 


In order to examine the quality of power and to get the reference voltage wave for 
DVR need to sample signal measured. Usually periodic signal sampling requires the 
sample is strict synchronous with measured signal. If sampling signal or measured 
signal be interference, the small changes of frequency of signal will cause spectrum 
leakage and truncation errors, then cause incorrect calculation results. Sliding-window 
iterative DFT[4],[5],[6] based on adaptive sampling algorithm[7] in present in this 
paper can effectively solve these problems. 


2.1 Sliding Window Iterative Discrete Fourier Transform 


For any finite bandwidth periodic signal x(t) Suppose the period T, N is the num- 


ber of sample, sample periodic t = T/N ,Discrete Fourier Transform expression is: 


X(kT) = A+S. A, cos(n@kT)+ B, sin(n@kt), k =0,1,2-:-, (N -1) (1) 
ae a (2) 
A,= 25 (it) cos(naiT) (3) 
n N = 
B= >) x(iT) sin(n@iT) (4) 


The calculation of expression(1)-(4) need N sample data of whole sampling period, 
which lead to large amount of calculation and time-consuming, so above expressions 
are not suitable for instantaneous voltage detection. 

Using sliding window iterative thoughts to improve expression (3) and (4): 


2, Noey= N+ 


A, = a >) x(it) cos(nair) (5) 
De aoa ’ 
BL=— ys x(iT) sin(n@iT) (6) 


ION ei 
Ny Noted that the latest sampling point, x(i7) indicate sampled data before No i the 
sampling point. The latest real-time sampling data involved in the signal detection and 
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analysis, while the corresponding eliminate the old sampled data, thereby greatly 
accelerated the rate of sampling data update and improve the system to track changes 
of load current or voltage. Fundamental signal can be calculated by the following 
formula: 


x, (kT) = A, cos(@kT) + B, sin(@kt) (7) 
2 Noow —N+1 
A=— >) x(it)cos(air) (8) 
N’ izn., 
2 Noew —N+1 
B,=— >) x(it)sin(wir) (9) 
Nien, 


When calculating formula (7)-(8), if the system sampling frequency is relatively high, 
then the Calculated Quantities will also be relatively large. In order to simplify the 
calculation process software sliding window iterative process [2] can be adopt. 

At first a full cycle of N points of sampled data need to be stored n a contiguous 
space after multiplying with the rotation factor, and then need to set up a data pointer 
to locate position of the current sampled data, when the completion of a full cycle of 
N points calculation, the corresponding data pointer should point back to the starting 
position, begin the next cycle of replacement cycle of data. Thus equation (8) and (9) 
are simplified as the calculation of a subtraction and a sum. The new calculation result 
is restored to the old data storage unit, complete the iteration process. 


Nnew -N+1 Nour =N 


>) x(it)cos(@it)= >° x(it)cos(wit)— 
i=Npew i=Noy,—l 
x(N,,,, — Nt cos[@(N,.4, —N)T]+ X(N oy 7) COSCON, 7) (10) 
Naew —N+1 Naew —N 
>) x(iz)sin(@it)= >° x(it)sin(@ir)— 
i=Nnew i=Nyew —1 
x(N,,,, —N)tsin[@(N,,,, — N)t]+x(N,,.,7) sin(@N,,.., 7) (11) 


The whole calculation process needs the whole period summation operations in the 
initialization stage. Afterwards, when a sample data enters, iterative computation will 
complete a new value iteration. In this way, the delay caused by calculation process 
greatly reduced. 

This algorithms require the detected signals must be periodic signal, at the same 
time require digital sampling signals strictly synchronous with the detected signals. 
For the actual electrical signals, this requirement is too stringent in fact to meet. The 
actual power system frequency is always changing, although the change is generally 
slow. According to the characteristics of the electric signal adaptive sampling algo- 
rithm can be adopt, this algorithm can automatically adjust the sampling time in order 
to reduce the synchronization error and increase accuracy. 
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2.2 Adaptive Sampling Algorithm 


x(t) is a continuous periodic signal, Suppose the period T If the actual sampling 
period is T,, Let x(t) go through time window of length LT, get N discrete-time 


series x(nT,), N is the number of sample. L is the number of cycles intercepted, 


Both N and L is integral number. In synchronous sampling, =N When the sig- 
S 
nal frequency changes while the sampling period is fixed, which called non- 


synchronized sampling, = # N Which means according to the original sampling 


Ss 
period to sample the sampled data is no longer ‘N’ in a signal period. So suppose the 
ideal sampling period isT,, while the actual sampling period is7,. € is the error 


between T,,and T,, T, —T,, =€. 


Suppose x, (7) an ideal sample sequence: 
Xq(n) = X(nNT5.) = x(nT, —né€) (12) 
In nT, according to Taylor series, ignoring higher-order items, there are 
Xy(n) = x(nT, )— x'(nT, )n€ (13) 
And then according to the definition of derivative 


x(nT, +NE)—x(nT,) _ x[n(T yo +e)+ Ne| —x(nT, ) 


x’ (nT; ) = 
Né Né (14) 
_ x[nT5. +ne+ Ne|—x(nT, ) 
Né 
where NT,, is the period of sequence x(n), therefore, 
x[nT oo +ne+ Ne| = x[nT., +ne+Ne+ NT, | =x|(N +n)\(Ty, + e)| (15) 
= x[(N+n)T, ] = x[(N +n)] 
I x'(nT,) = x(n+ N)— x(n) (16) 
Ne 
x(n) = x(n) etal ae ae [x(n) — x(n+ N)] (17) 
Ne N 


Equation 17 is the expression of signal adaptive sampling algorithm expression. Algo- 
rithm is simple and easy to project implementation, thus extremely suitable for real- 
time signal processing applications. Algorithm flowchart shows in Figure 1. 


A Novel Reference Wave Generating Algorithm 111 


System Initialization 


Adaptive Sampling 


Sliding cycle pointer 


| 


Get new sampled values: 


reference voltage wave 


Calculation A; . By 
| And the fundamental 


=a 


Fig. 1. Flowchart of adaptive sampling DFT algorithm 


3 Simulation 


Simulation is carried in matlab. In which the signal sampling frequency is set to 6.4k, 
that each period sample 128 points. When the signal is periodic signal using a fixed 
sampling period to sample, the simulation results shown in Figure 2. When the signal 
frequency changes from 50Hz to 51Hz at 0.1s and changes from 51Hz to 50Hz at 0.3s 
using a fixed sampling period will get incorrect results. 


i i 
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(a) measured signal 


0.05 o1 0.15 02 0.25 03 0.35 a4 
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(c) Reference voltage wave 


Fig. 2. Simulation results under non-synchronous sample 
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Fig. 3. Simulation results of voltage sag 
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Fig. 4. Simulation results of voltage swell 


Adaptive sampling simulation results shows in Figure 3 and 4. The system has a 
good tracking performance. when the signal is periodic signal, no need to use adaptive 
sampling algorithm, but when the signal frequency changes, if do not use adaptive 
sampling algorithm, then the frequency change, the system ran at a fixed sampling 
interval according to the original sampling, this will get the wrong result, but after 
adapting the adaptive sampling algorithm, the system can automatically adjust the 
actual sampling time. This can avoid the problems of DFT in the non-synchronous 
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sampling. The complexity of the algorithm does not increase in number, namely, the 
real-time system can still be guaranteed. 


4 Conclusion 


A new sliding window iterative DFT algorithm based on adaptive sampling algorithm 
is present in this paper. The algorithm can significantly reduces errors of DFT in the 
non-synchronous sampling caused through automatically adjusts the sampling time. 
The algorithm can effectively detect the reference instruction voltage and has good 
real-time, Simulation results show the feasibility of the algorithm. 
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Abstract. Sensor networks are battery powered, and they are generally dep- 
loyed in the place which is hard to access to and environmentally harsh. The re- 
placement of battery is not feasible, so the energy constraint is one of the fea- 
tures of sensor networks. It is essential in sensor networks design to make the 
best of the limited power of the sensor nodes. This paper is to make some ex- 
ploration and research about the related information aggregation assessment 
based on sensor networks. 


Keywords: Sensor networks, Data collection, Data redundancy. 


1 Introduction 


As the rapid development of the embedded system, wireless communication, network 
and micro-electro-mechanical system, the sensor network with its sensing, computing 
and wireless communication capabilities and wireless sensor network which is consti- 
tuted by it aroused great attention. Wireless sensor network is a network made by self- 
organizing of many sensor nodes with the wireless communication technology. It 
integrates three great technologies: sensors, micro-electromechanical system and 
network, aiming to sense, collect and process the information of perception objects 
within the network coverage and then forward to the observer. It is a system centered 
in data processing [1]. Wireless sensor network is a special wireless communication 
network with an enormous number of nodes and cheap cost. So it allows fast deploy- 
ment without relying on any fixed facilities. And it can collect information on time, 
accurately and comprehensively in many occasions. Thereby it changes the interact 
style between human and nature and is considered as one of the most important tech- 
nologies in the 21st century. Nearly most nodes are battery powered, usually can’t 
supplement energy, so energy-saving is one of the most crucial objectives in wireless 
sensor design. 


2 The Analysis and Exploration of Information Aggregation 
Assessment Based on Sensor Networks 


Sensor is the key component to collect and process data. It can map a physical quanti- 
ty in the physical world to a quantitative measurement to help people to form 
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quantization cognition of the physical world. The general structure of the sensor in- 
cludes the sensor module, the information processing module, wireless communica- 
tion module and power supply modules. Sensor perception module mainly senses and 
acquires the external information and converts it into digital signals. Information 
processing module processes and keeps the data collecting by the sensor node itself 
and other nodes, controls the perception component and the power work mode etc. 
Wireless communication module communicates with other sensor nodes. In the 
process, the electricity provides the necessary power for work [2]. 

Wireless ad hoc network is a multi-hop, peer-to-peer and mobile network made up 
of dozens, even hundreds of nodes. It adopts the wireless communication, and is dy- 
namic in networking. The aim of it is to transmit the multimedia information flow 
requiring service quality by means of the dynamic routing and the technology of mo- 
bile management. In addition to common characteristics of mobility, self- 
organization, finiteness of the power to the wireless ad hoc network, wireless sensor 
network also has distinctive features. For example, the topology of the network is 
easy to change, resources of nodes are very limited, and the area that can be moni- 
tored is wide. Specific features are as follows: The communication capability is 
limited, and the transmission rate of sensor nodes is low. In most cases, the communi- 
cation distance is only several dozen or several hundred meters. As the nodes are 
often working in areas with appalling conditions, they are more exposed to the impact 
of terrains and landforms like high mountains, buildings, obstacles and of the natural 
environment such as wind, rain, thunder, lightning, humidity, and flooding, resulting 
in the unreliability of the communication between sensor nodes, and on the other hand 
a long-time bug or even damage to the nodes. Computing power is limited, so sensor 
nodes generally use embedded processors and memories. These nodes have the com- 
puting capability to complete certain processing work of information, but due to the 
limited capability and capacity of embedded processors and memories, the nodes' 
processing capability therefore is very limited. With limited power energy and a small 
size, a sensor node usually carries a very limited battery power. Since the number of 
sensor nodes, who have a wide range of distribution and a complex environment to be 
deployed, is large, and sensor nodes require low and cheap costs, during the using 
process, batteries thus can not be recharged or replaced. If the energy of batteries is 
used up, nodes will lose their effect. Therefore, the constraint of the energy is a se- 
rious problem that hinders the application of a wireless sensor network. Sensors con- 
sume more electrical energy in transmitting information than making calculations, 
hence in the working process of the network, we need to save energy, and maximize 
the life cycle of the network. The size of the network is large, and the topology is very 
complex. Besides, nodes in a sensor network are intensive. Its large quantity may be 
hundreds or even tens of millions. In addition, sensor network can be distributed in a 
wide geographical area, and its range of perception is also very large. Moreover, those 
three elements in a sensor network, sensor nodes, the perceptual objects, and the ob- 
server, are all possible to move, and it is quite often that new nodes will join or the 
existing nodes may fail. Therefore, along with the constant changes in the network's 
topology, the path between the n nodes of a sensor and the observer changes at the 
same time. Sensor network has the nature of self-organization. In the application of a 
sensor network, sensor nodes usually can not be accurately pre-set, and the relation- 
ship between adjacent nodes is also unknown in advance, such as scattering a large 
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number of sensor nodes by aircraft to the vast primeval forests, or randomly placing 
them in dangerous areas where people are unable to reach. This requires sensor nodes' 
capability of self-organizing, meaning that they can automatically configure and man- 
age, and through the topology control mechanism and the network protocol sensors 
can automatically form a multi-hop wireless network system which forwards the mon- 
itoring data. Sensor nodes will lose effectiveness due to the energy depletion or envi- 
ronmental factors, and there are also some nodes who are added to the network in 
order to make up for the failure nodes, thus making the network topology dynamically 
change with it Self-organization of the sensor network should be able to adapt to this 
dynamic change. Data transmission's directionality is also very strong. In a wireless 
sensor network, data transmission is highly directional. Typically, the inquiry infor- 
mation is transmitted from the observer to the sensor nodes within the network by 
way of broadcasting or multicast, while information of the detection results clusters 
with the help of sensor nodes distributed everywhere into the query nodes. Fig. 1 
shows the sensor network architecture. 


Sink 


Sensor 
Field Sensor 
Node 


Fig. 1. WSN architecture. 


Wireless sensor network is a ubiquitous sensing technology, which allows users to 
more deeply understand and grasp the world around them. Having very broad applica- 
tion prospects, the wireless sensor network, with a high using value, can be applied to 
military affairs, environmental monitoring and forecasting, health care, smart home, 
condition monitoring, space detection, warehouse management and safety monitoring 
in large areas. Field of military affairs: wireless sensor network is rather widely used 
in military affairs, such as monitoring the status of the battlefield, acting as a guidance 
device of smart weapons, detecting and determining attacks of nuclear, biological and 
chemical weapons, installing sensors in people, equipment, and arms of the military 
armies for recognition, and throwing sensors to the enemy's positions to detect intelli- 
gence, and the like. The U.S. Defense Department and all the military departments 
attach great importance to the sensor network. They brought forward the C4KSIR 
plan based on C4ISR, emphasizing the capability of perceiving the intelligence from 
the battlefield, and of synthesizing and using information. Taking sensor network as 
an important area of research, military departments set up a series of research projects 
on military sensor networks. Field of environmental protection: as people become 
increasingly concerned about the environment, environmental science involves a 
increasingly larger scope. Wireless sensor network provides convenience for the ac- 
quisition of random data for research in the field, for example, monitoring compo- 
nents of oceans, the atmosphere and soil, studying the impact of environmental 
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changes on crops, determining the rainfall conditions, offering accurate information to 
prevent floods and drought, real-time monitoring of air pollution, water pollution and 
soil contamination, and the monitoring of forest fires. Field of medical treatment and 
health: embedding network sensors in shoes, furniture, household appliances and 
other devices can help the aged, people who're seriously ill and the disabled with their 
family life. With the use of sensor networks, necessary information can be delivered 
efficiently, thus facilitating people's access to nursing, reducing the burden on the 
nursing staff and improving the quality of nursing. Using sensor networks can also 
collect people's physiological data for a long time, and accelerate the development 
process of new drugs, while micro sensors installed in the monitored objects will not 
bring inconvenience to people's normal life. In short, sensor networks equip the future 
telemedicine with a convenient and efficient means of technology. Field of space 
exploration: by means of setting sensors nodes in the objects of the outer space, where 
humans are inaccessible at present or can not work for a long time, a long period of 
monitoring of those objects can be achieved. Through the analysis of data returned by 
these sensor nodes, we may have a better understanding of those objects. Other appli- 
cation fields: self-organization, miniaturization and the perception of the outside 
world are the three major characteristics of the wireless sensor network, which deter- 
mine that it is possible for the wireless sensor network to have not a few opportunities 
in other fields. For instance, sensor network can combine with PDA thus forming a 
personal server, which is able to obtain the real-time information of the network and 
filter all the related information by visiting information points so as to provide people 
with the needed information [3]. Apart from that, in many fields like the rescue during 
disasters, warehouse management, interactive museums, interactive toys, automatic 
production lines in factories, wireless sensor network will breed a totally new design 
and mode of application. 

Information aggregation plays a very important role in wireless sensor networks, 
mainly reflected in saving the energy of the entire network, enhancing the accuracy of 
the collected data and improving the efficiency of data collecting. Wireless sensor 
network is constituted of a large number of sensor nodes covering to the monitoring 
area. In the deployment, sensor nodes need to reach a certain density in order to en- 
hance the robustness of the whole network and the accuracy of the monitoring infor- 
mation. Sometimes the monitoring scopes of several nodes have to overlap. And it 
will lead to a certain degree of redundancy among the information reported by neigh- 
boring nodes [4]. Information aggregation is to deal with the superfluous data within 
the network by integrating the data and removing the redundant information to mi- 
nimize the quantity of data conveying under the promise of satisfying the application 
needs. 

There is no unifying method in information aggregation, it needs related computa- 
tion method depending on different application backgrounds, such as the adaptive 
weighted aggregation algorithm. Suppose there are ” sensors measuring one certain 


environment object. Firstly, make a data check of xi(i=2,---n—1), the testing 
criteria is that the difference between the adjacent numbers should be no more than 
the given threshold € , that is to say, [xi +1— xi <€,(i=2,---n—-1), € is based on 


the accuracy degree of the sensor measuring. Suppose the measurement data of n 
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sensors meets [xi +1— xi <é, (i =2,---n—- 1) . Then let's conduct an estimate of the 


adaptive weighted fusion for these measurements. The basic idea of the adaptive 
weighted fusion algorithm is: with the optimal condition that the overall mean square 
error is the minimum, according to measurements provided by various sensors, find 
the corresponding optimal weighting factor of each sensor in the way of self-adaption 


so as to make the result of x after fusion become optimal. Suppose variances of n 


Dip 
sensors respectively are (> (i =1,2,---7) ; the truth value needing to be estimated 
is X , and the measurement value of each sensor is X1,.X2,-°+: Xn, which are indepen- 
dent of each other and are the unbiased estimates of x . The weighting factor of each 


sensor is Wi(i =1,2,---), so the truth value of x after fusion and the weighting 


= n n 
factor satisfy: x = >: Wiki , ¥ Wi = 1.The overall mean square error is 
i=l i=l 


o* =£|(x-x) |=e wrx)? +2 y ww (x — x; )(x—x,) 


i=l, j=Li#j 


Since *1,.%2,°**Xn are independent of each other, and they are the unbiased estimates 


of x, thus E] (x—x,(x—x,)]=0, @# i512. 7=12--,n), 


— 2 — 22 
Therefore, O° = E > w, (x-x,)° => w, O, - Assuming £ stands for 
i=l 


i=l 
the set of sensor nodes monitoring events: E= 112i O50 ey xX ; Stands for the 
random variables of the monitoring events set in node 1, X ; gets its value in the 
sample space ), Q stands for the set of the possible monitoring events of any nodes 
in the sensor network, is the power set of — , namely: Q = {wlwe W(E)} . Sup- 
pose EF, stands for the monitoring events set of sensor node i, A, stands for the 


monitoring scope of node 1, O stands for the number of the events in per unit area, 


namely: |Z, a OA, . Assuming K data packets aggregate to one data packet, every 
data packet received by the sink node is Lbit , changes into L bit after aggrega- 
tion. Then we can calculate the combination entropy of the aggregation entropy, that 


is to say, the data H f contained in the mixed data, then: 


k 

A,= H(X,,X,.°°°,X,) = Lé, . Based on the above assumptions, let’s design 
i=l 

an assessment of information aggregation as the ratio of joint entropy of aggregation 
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data in the aggregation nodes and aggregation data bits, namely H f / L fo If the ratio 


of the input and output in the aggregation algorithm is a constant value, then the in- 


k 

formation aggregation can be defined as r= L f i » L, . Based on the above de- 
i=l 

fined information aggregation degree, the assessment of information aggregation can 


k k 
be further defined as H , / l= DE, ome . Consequently, the greater the 
i=l i=l 
aggregation data, the smaller the assessment result. The visual effect of information 
aggregation is to reduce the amount of data transmission, thus save energy. The best 
polymerization condition is that the intermediate nodes can merge nN equal-length 


input data groups into one output group, its energy-saving efficiency is (n —1) / n;in 
the worst case, the aggregation operation does not reduce the data amount, but reduce 
lots of operations in consulting and channel contention by decreasing the number of 


groups. Thus, it decreases the overall overheads of the transmission unit, meanwhile, 
save the energy [5]. 


3 Conclusion 


There are a great amount of nodes densely distributed in the wireless sensor network, 
and the network will produce a large number of data. If we do not apply the informa- 
tion aggregation technology, there will be a great deal of excessive information which 
will consume a lot of energy. Meanwhile, the high probability of conflict will lead to 
the failure of sending and receiving of information. And the retransmission of infor- 
mation will consume much energy and increase the delay. In a word, the information 
aggregation technology is essential and necessary. 
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Abstract. This paper presents an efficient MEMS gyro aided automatic calibra- 
tion algorithm for three-axis magnetic compass. This electronic compass mod- 
ule consists of a three-axis magnetic sensor, a two-axis inclinometer and a 
MEMS gyro. The magnetic electronic compass is used to determine the heading 
of a indoor mobile robot with respect to the magnetic North. The automatic 
calibration method requires the mobile robot to make three full 360-degree rota- 
tion. In this rotation procedure, magnetic field data, attitude data and angular 
rate data are recorded. According to magnetic field data and attitude data, raw 
heading data is calculated. This raw heading data is verified by angular rate data 
from MEMS gyro. Results of experiment show that the accuracy of calibrated 
compass is better that 1 deg and MEMS gyro aided automatic calibration algo- 
rithm is effective for electronic compass. 


Keywords: magnetic electronic compass, MEMS gyro, calibration, mobile 
robot, heading. 


1 Introduction 


The Earth’s magnetic field intensity is about 20 to 50 A/m and has a component paral- 
lel to the Earth’s surface that always point toward magnetic north [1-4]. This is the 
basis for all magnetic compasses. Magnetic compass has been used in navigation for 
centuries. Today, advances in technology have led to the solid state electronic com- 
pass based on MR magnetic sensors and acceleration based tilt sensors. Electronic 
compasses offer many advantages over conventional “needle” type or gimbaled com- 
passes such as: shock and vibration resistance, electronic compensation for stray field 
effects, and direct interface to electronic navigation systems. 

When a compass is operating in a open area in the absence of any ferrous metals 
there is no distortion effects on the earth’s magnetic field. In reality, though, com- 
passes are mounted in vehicles, aircraft, and platforms that most likely have ferrous 
materials nearby. The effects of ferrous metals (iron, nickel, steel, cobalt) will distort, 
or bend, the earth’s field which will alter the compass heading. These effects can be 
thought of as a magnetic field that is added to the earth’s field. This will introduce 
heading measurement error to compass. 
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In this paper, an efficient gyro aided compass calibration algorithm is proposed. 
The magnetic electronic compass is used to determine the heading of a indoor mobile 
robot with respect to the magnetic North. The automatic calibration method requires 
the mobile robot to make three full 360-degree rotation. In this rotation procedure, 
magnetic field data, attitude data and angular rate data are recorded. According to 
magnetic field data and attitude data, raw heading data is calculated. This raw head- 
ing data is verified by angular rate data from MEMS gyro. Results of experiment 
show that this algorithm can be used to realize heading calibration for magnetic elec- 
tronic compass. 


2 Anistropic Magnetoresistive Sensor and Its Application to 
Electronic Compass 


The anisotropic magnetoresistive (AMR) sensor is one type that lends itself well to 
the earth’s field sensing range. AMR sensors can sense DC static fields as well as the 
strength and direction of the field. This sensor is made of a nickel-iron (Permalloy) 
thin film deposited on a silicon wafer and is patterned as a resistive strap. The proper- 
ties of the AMR thin film cause it to change resistance in the presence of a magnetic 
field. Typically, four of these resistors are connected in a Wheatstone bridge configu- 
ration so that both magnitude and direction of a field along a single axis can be meas- 
ured. For typical AMR sensors, the bandwidth is in the 1-5 MHz range. The reaction 
of the magnetoresistive effect is very fast and not limited by coils or oscillating 
frequencies. 

The electrical output of AMR sensor is proportional to the magnetic field strength 
along its sensitive axis. When a AMR sensor is spun around a horizontal plane start- 
ing from magnetic north, the output is a cosine function of the heading angle. A 
minimum of two sensors that are arranged mutually perpendicular would eliminate 
the ambiguity in electrical output with respect to heading direction as seen in Fig. 1. 

For a two-axis compass without tilt compensation, azimuth or the heading is cal- 
culated by the equations given below [1-8] 


A= arcTan{ 2) (1) 
Xx 


The X sensor defines the forward direction and the Y sensor is to the right. 

This two-axis compass will perform well as long as it is kept horizontal and is use- 
ful in hand held applications. However, operation of the compass while it is not level 
can result in considerable amount of heading error. 


2.1 Electronic Compass with Tilt-Induced-Error Compensation 


Most often compasses are not confined to a flat and level. If the compass were tilted, 
the tilt angles (pitch and roll) and three magnetic field components must be used to 
calculate heading [1-8]. An inclinometer, or tilt sensor, should be used to determine 
the roll and pitch angles. The terms roll and pitch are commonly used: ROLL refers to 
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the rotation around the X-axis, or forward direction, and PITCH refers to the rotation 
around the Y-axis, or left-right direction. 

The general magnetic compass module with tilt-induced-error compensation con- 
sists of a three-axis magnetic compass and a two-axis inclinometer to calculate tilt- 
compensated azimuth information [1-8]. Fig. 1 shows an functional block diagram of 
a three-axis compass with tilt compensation. 


Pitch 


2 -Axis 
Tilt Sensor 
Analog to 
Digital 


; uProcessor 
3-Axis 


Magnetic 


Sensor 


Fig. 1. Functional block diagram of a three-axis compass with tilt compensation 


The compass must now rely on all three magnetic axes (X ,Y ,Z) so that the 
earth’s field can be fully rotated back to a horizontal orientation. In Fig. 2, a compass 
is shown with roll and pitch tilt angles referenced to the right and forward level direc- 


tions of the observer or vehicle. The X ,Y and Z magnetic field data can be trans- 
formed back to the horizontal plane (X,,, Y,,) by applying the rotational equations 
shown below: 

Roll compensation. 


According to the roll angle, roll compensation is done by applying the rotational 
equations shown below. 


Xie =X 
Y., = Ycos(@)+ Zsin(@) (2) 
Z,, =—Ysin(@)+ Zcos(@) 


where @ is roll angle. 


Horizontal and vertical magnetic components calculation. 

After roll compensation, according to the pitch angle, the X , Y and Z magnetic com- 
ponents can be transformed back to the horizontal plane and vertical direction by 
applying the rotational equations shown below. 
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X,, = X,,cos(g)- Z,,sin(@) 
y, =i, (3) 
Z, = X,, sin(g)+Z,, cos(9) 


where @ is pitch angle. Horizontal and vertical magnetic components are gotten. 


Once X , and Y, » i the horizontal plane are known, (1) can be used to determine 


the azimuth. 


2.2. Hard Iron and Soft Iron Interference Compensation for Electronic 
Compass 


When a compass is operating in a open area in the absence of any ferrous metals there 
is no distortion effects on the earth’s magnetic field. In reality, though, compasses are 
mounted in vehicles, aircraft, and platforms that most likely have ferrous materials 
nearby. The effects of ferrous metals (iron, nickel, steel, cobalt) will distort, or bend, 
the earth’s field which will alter the compass heading. 

When the compass is mounted on the mobile robot, the effect of the robot body 
would distort the Earth’s magnetic field. Let the robot rotate in a circle would produce 
the curves shown in Fig. 2. 
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Fig. 2. Hard iron and soft iron interference of electronic compass (Unit: nT) 


The X,Y plot is not a circle (slightly ellipsoid) and there is offset from the (0, 0) 
point. This offset and ellipsoid effect are a result of the fixed distortion of ferrous 
metals on the earth’s magnetic field. 
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To compensate for the magnetic field distortion, two scale factors for X,, and 
Y,, can be determined to change the ellipsoid response to a circle. Offset values X (y 


and Y., can then be calculated to center the circle around the 0,0 origin. 
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where Xypax> Yuax> Xvi and Yay are maximum and minimum values of 


X,, and Y, respectively. 


3 MEMS Gyro Aided Automatic Compass Calibration 


Traditionally, gyroscopes were mechanical devices that measured the angular rate of 
rotation. MEMS (microelectromechanicalsystem)-gyroscope technology now pro- 
vides this function in a variety of packages that enable integration into PCB (printed- 
circuit-board)-based systems. MEMS gyroscopes employ tiny micromechanical 
systems on silicon structures, supporting motion-to-electrical transducer functions. 

An ideal MEMS gyroscope produces a predictable output when you subject it to a 
known rate of rotation. It has no noise, perfect linearity, and no offset. Perfection is 
not typically available on economical manufacturing processes, however, creating the 
need for a deeper understanding of MEMS-gyroscope errors. A yaw-rate MEMS 
gyroscope responds to motion about its predefined axis of rotation. The following 
equation provides a simple linear behavior model for a MEMS gyroscope: 


Gyro = KO gare a pias + noise (5) 


where Qeyro 18 output of MEMS-gyroscope, @p47, is the angular rate of rotation 
tested, Mg,,5 18 the offset of MEMS-gyroscope, @yojs- 18 the noise of MEMS- 


gyroscope, K is the scale factor of MEMS-gyroscope. 

MEMS angular rate gyroscopes do not measure angular displacement directly but 
rather the rate of angular motion. However, mathematical integration of angular rate 
with respect to time produces a relative angular displacement. It can track changes in 
the heading reliably in the short time. Errors in the mathematical integration accumu- 
late with time due to quantization effects, scale factor and bias changes, and other 
error sources in the signal output of the gyro. 
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In order to calibrate the electronic compass, first place mobile robot with heading 
O°in a level plane and let it be stationary, mathematical integration of angular rate 
output of gyro with respect to a fixed time interval produces a quantity proportional 
to the bias error. According to this quantity, the bias error of gyro can be calculated. 
The pitch angle (@) and roll angle (@) can be calculated according to the output of 


2-axis tilt sensor. 

Then let mobile robot make three full 360-degree rotation. In this rotation proce- 
dure, magnetic field data, attitude data and angular rate data are recorded. Because 
mobile robot is in level plane, pitch and roll angle are unchanged. According to real 
time magnetic field data, pitch and roll data in previous step, raw heading data is 
calculated. On the same time, mathematical integration of angular rate output of gyro 
with respect to time produces a relative angular displacement. Because mobile robot 
is of heading 0° at the beginning, this relative angular displacement can be used to 
induce true value of heading. 


Acyror Uf OS Agyro < 360 deg 
Onpup = Ooyro — N X360, if NX360 (6) 
< Acyro <(N +1)x360 deg 


where N is a integer. 
According to true value and raw data of heading angle, electronic compass calibra- 
tion can be achieved. 


4 Experiments 


The electronic compass is implemented using Honeywell’s Anisotropic Magneto- 
Resistive (AMR) sensor microcircuits HMC1022/HMC1021S, ADI’s accelerometer 
sensor ADXL203. In order to eliminate the influence of bridge offset, the AMR sen- 
sor is flipped periodically at 100 Hz with SET/RESET pulse. The output signal of 
each sensor is amplified by precise instrumentation amplifier INA118 and then 
converted into digital quantities. MEMS gyro ADXRS150 is used for automatic cali- 
bration of electronic compass. 16-Bit, 6-Channel Simultaneous Sampling Analog-to- 
Digital Converter ADS8364 is used for analog signal sampling. Microcontroller 
C8051F120 is used for data processing. The electronic compass is connected to com- 
puter through RS-232. 

At first, automatic calibration of electronic compass is made. Place the mobile ro- 
bot in level plane, and let it to make three full 360-degree rotation at speed of 10 
deg/s. According to recorded magnetic field data and attitude data, raw heading data 
is calculated. This raw heading data is verified by the mathematical integration of 
angular rate data from MEMS gyro. Calibration data is shown in Table 1. And then, 
let mobile robot rotate one full revolution slowly, raw heading is calculated according 
to magnetic field data and attitude data. Based on these calibration data, linear inter- 
polation method is chosen to compensate the error of heading. Test result is shown in 
Table 2. This show that the accuracy of calibrated compass is better that 1 deg and 
MEMS gyro aided automatic calibration algorithm is effective for electronic compass. 
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Table 1. Electronic Compass Calibration Data (Unit: degree) 


Heading Heading Heading Heading 
(True value) (Raw) (True value) (Raw) 
0.0 9.3 180.0 189.4 
10.0 18.5 190.0 198.4 
20.0 26.6 200.0 206.7 
30.0 34.4 210.0 214.4 
40.0 41.7 220.0 221.5 
50.0 48.9 230.0 228.8 
60.0 56.4 240.0 236.5 
70.0 64.1 250.0 244.2 
80.0 72.2 260.0 252.3 
90.0 81.4 270.0 261.4 
100.0 91.5 280.0 271.6 
110.0 102.9 290.0 282.8 
120.0 115.4 300.0 295.3 
130.0 128.9 310.0 308.7 
140.0 142.5 320.0 322.7 
150.0 155.6 330.0 335.9 
160.0 168.5 340.0 348.2 
170.0 179.4 350.0 359.4 


Table 2. Test Result of Electronic Compass (Unit: degree) 


Heading Heading Heading Heading 
(True value) Error (True value) Error 
0.0 0.2 180.0 0.4 
15.0 0.4 195.0 -0.4 
30.0 -0.3 210.0 -0.3 
45.0 0.5 225.0 0.5 
60.0 -0.2 240.0 -0.3 
75.0 -0.6 255.0 -0.7 
90.0 0.1 270.0 0.5 
105.0 0.3 285.0 0.4 
120.0 -0.2 300.0 -0.3 
135.0 0.7 315.0 -0.7 
150.0 0.4 330.0 0.2 
165.0 -0.5 345.0 0.8 


5 Conclusion 


In this paper an efficient calibration algorithm for three-axis magnetic compass is 
presented. The automatic calibration method requires the mobile robot to make three 
full 360-degree rotation. In this rotation procedure, magnetic field data, attitude data 
and angular rate data are recorded. According to magnetic field data and attitude data, 
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raw heading data is calculated. This raw heading data is verified by angular rate data 
from MEMS gyro. Results of experiment show that the accuracy of calibrated com- 
pass is better that 1 deg and MEMS gyro aided automatic calibration algorithm is 
effective for electronic compass. 
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Abstract. The simulation method of three-phase voltage source rectifier(VSR) 
using the Space Vector pulse width modulation (SVPWM) and feedforward de- 
coupling control strategy is studied in PSCAD. Firstly, the mathematical model 
of VSR under the rotating coordinates axis was set up; then discussed the reali- 
zation of SVPWM in PSCAD, and designed a decoupling control system of 
VSR; lastly, compared the performance of SVPWM and SPWM rectifier, the 
simulation results proved the dual loop control strategy of VSR is correct, the 
SVPWM tectifier has the advantages of less Harmonic content ,easier digital 
realization. 


Keywords: three-phase voltage source rectifier; space vector PWM; feedfor- 
ward decoupling; PSCAD. 


1 Introduction 


The traditional rectify method usually uses phase-controlled rectify or uncontrollable 
rectify, they have the shortcomings of slow dynamic response, high harmonic content 
in power side. Three-phase voltage type pulse-width modulation(PWM) rectifier has 
the advantages of higher power factor, lower harmonic content ,are mainly used in 
medium-power transform circuit, act as the direct current(DC) power of inverter cir- 
cuits or uninterruptible power supply (UPS), it’s research focus of green power reali- 
zation and harmonic contamination elimination in recent years. 

This paper first discusses the feedforward decoupling dual close-loop control strat- 
egy based on three-phase voltage rectifier, and then elaborates the implement of space 
vector pulse width modulation (SVPWM) method in the simulation software PSCAD 
environment, lastly, simulation study of the VSR control based on sinusoidal pulse 
width modulation (SPWM) and SVPWM modulation is performed, and the corres- 
ponding simulation waveform is given. 


2 Control Strategy of VSR 


2.1 Mathematical Model under Three-Phase Stationary Coordinate System 


The voltage-source type PWM rectifier is show in Fig.1.Here, e, represents the 


source voltage and j, represents the input current, x=a, b, c. C is the capacity of the 
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DC-side filter capacitance, R and L denotes the resistance and inductance respective- 
ly, izgaq 18 the load current of DC side, yg, is the DC bus voltage, Ry is the load. 


Lia 

Wa gg t RS Ve dl 

R é ™N ZN N ™ Uae 

é a . 
Co Wor a db i) Gl pr. 
Ore WY) >b G 
L@jleVW\v = mC 

Va x Vo Va 


Fig. 1. Main circuit Three-phase voltage-type PWM rectifier 


Define switching function of three-phase rectifier bridge as: 


( upper is ON and lower is OFF 
k — 


(k = a,b,c) (1) 
0 upper is OFF and lower is ON 


Considering the displayed state variables on the circuit of Fig.l, neglecting switch 
delays, on-state semiconductor voltage drops, snubber networks, and applying Kir- 
chhoff laws, Voltage equation of phase A can be obtained[1]: 


L& + Rig= ea~ (wan + UNO) (2) 


When Vj is on and Vg is off, switching function §,=1, ugn =ude» When V4q is 


onand Vj is off, §,=90, ugy =9.for ugn =udcS q > 80 (2) can be rewrote as: 

eq= LE + Rig t (udeSat UNO) (3) 
Similarly the availability of phase B and phase C equation as follows: 

ep= Lt Rin + (udeSb* uno) (4) 


ec = LE + Ric t+ (udeSc+ uno) ©) 


For the three-phase three-wire balanced system, the following equations are 
established: 


e +e,te = 0 
fora. (6) 
iqgttptt, =0 
derive from(2)- (6): 
SatSptsS 
uNO =-——~—£ ude (7) 


3 


A Simulation Study of Space Vector PWM Rectifier Based on PSCAD 131 


In addition, using Kirchhoff's current law on DC bus capacitor positive node: 


Cte = igSat ivSb+ iS itoad ' 


2.2 Mathematical Model under Synchronous Rotating Coordinate System 


Reference coordinate system is shown in Fig.2, and using the Park transform, derive 
from(2)-(8), we can get mathematical model of VSR under dq coordinate system[2]: 


dia 5 ; 
ed= LEA + Rig— OLigt ud 
dig : F 
eq= Lo +Rigt OLigt+ug (9) 


dus 3. ; ; 
CfG 7 9 asd + igsq) ~ iload 


Here eg, e q and jg, i q denote the input voltages and the input currents in the syn- 
chronous reference frame, Where jg is active current component, ig is reactive 


current component, The direction of d-axis orientate according to the direction of 
voltage vector E ,so eg=E, eg= 0. 


The input of active power and reactive power of grid side are[3]: 


3 3 
e = 5 (edia + egiq) = 5 edid 
(10) 


3 3 
= 5 (egid — edig) = 5 edig 


As can be seen from the above analysis, after coordinate transformation, input active 
and reactive power of the rectifier have been decoupled, the voltage and current are 
DC components in the d-q axes, the coordinate transformation provides the conditions 
for high performance control. 


ala) 


Fig. 2. Reference coordinate systems 
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2.3 Dual Loop Control Strategy of VSR 


Use the feedforward decoupling control strategy, the current regulator with PI regula- 
tor, ug> u q Equations as follows: 


Kj : F 
ud =—(Kipt+ Ay" -i,) + OLigt+ed 
S 
(11) 
= Kily,*_. ; 
Ug=—(Kip ae gt) —OLigteg 


When the rectifier operates in unity power factor state, the input of reactive power of 
grid side is zero, i.e. j = 0, By controlling the d-axis current to control the active 


power ,and controlling the q-axis current to control the active power. A control sys- 
tem with dual close-loops of voltage and current was designed based on the mathe- 
matical model of three-phase PWM rectifier, the control structure of the system is 
shown in Fig.3[4]. 
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Fig. 3. Dual close-loop control scheme 


3 SVPWM Control of Three-Phase VSR 


3.1 Theory of SVPWM 


The power electronic converter can operate only eight distinct topologies. Six out of 
these eight topologies produce a nonzero output voltage are known as active vectors 
(U1- U6) and the remaining two topologies produce zero output voltage are known as 
zero vectors (U0, U7). The six active vectors numbered I to VI in a stationary refer- 
ence frame as shown in Fig. 4. 

The active vectors U1 to U6 are obtained as follows: [5] 


Up= Su jered FAA/3) (12) 


Here, k =I, 2,...,6. 
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4, 


U3(010) | U2(110) 


Us(001l) | U6(101) 


Fig. 4. Representation of the inverter states in the stationary reference frame 


Let the desired voltages in a stationary (@— axis) reference frame are U”. 


Then, the vector, magnitude and angle of desired voltage can be given as follows: 


elo (13) 


u*=|v" 


o* = tan [wg/widl: |U"|= fue? + ui (14) 


From Fig. 4 finds that, assuming U™ to be lying in sector k , the adjacent active vec- 


tors are U; and U;z41. To achieve the desired stator voltage U* within the sam- 


pling time 7, , the active voltage vectors U, and U;4 should be activated during 
the time 7; and T;4;, respectively. Hence, the on-time 7, and T;4) are eva- 
luated by the following equations: 


UT s=UxT RtU HT KH (15) 


Splitting this vectorial equation into real and imaginary components, from (12)-(15) 
follows that: 


16 
SDs, te) 


3 


. ka * 
rs | sinbhees# 


Tk+1 Udc 


sin(@- - 


The on-time for the zero state vectors is obtained by the following equation: 


TO=Ts—(Tet+T Kaw (17) 
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When 7, +7TK41>T 5 the time 7, and 7,4; are rescaled as follows: 


te _ Ts | (18) 
Trai} TktT pa LTKH 


3.2 The Implementation of SVPWM in PSCAD 


Implementation of SVPWM is divided into three steps: [6,7,8] 

(1)Determine the sector number of the reference vector Units 
A = UB 

1 

B=—(V3 ua—UuB) 
Define variables A, B, C, P as: ; 
C= 5B ua UB) 
P=sign(A) + 2sign(B) + 4sign(C) 
1,x2=0 
0,x<0' 


Then, the corresponding relations of P and the sector number is shown in Table 1: 


Here is a sign function, defined as sign(x) = 


Table 1. Correspondence between P and sector NO. 


Sec. No. I I Il IV Vv VI 
P 3 1 5 4 6 


N 


(2)Determine the working period of the two adjacent vectors 
X= V3u pT s / ude 
Define }¥ = @uq+V3ug)T s/2ude > 
Z =(-3uqt V3up)T s ! ude 
When the desired voltage vector is in the six different sectors, the working period of 


the two adjacent vectors 7), 77 can be got from Table 2: 


Table 2. The value of T; .T 2 
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If Ty+T2>T, we should rescale 7, , To refer (18),ie. 
T1=TT5/(1+T2).T2=T2T s/ (11+ T2) 
(3). Determine the on time and off time of power device in the VSR 
Ta=(s-T,-TD/4 
Define; T,=Tqt+T 1/2 ; 
Tc=TptT2/2 


Then, the on time of power device V;,V3,V5 in every cycle can be got from 
Table 3, the Corresponding off time is T,— T em - 


Table 3. The value of T cy 


Sec. NO. I jit ll IV V VI 
T cml Ta Tb Tc Tc Tb Ta 
T cm2 Tb Ta Ta Tb Tc Tc 
T cm3 Tc Tc Tb Ta Ta Tb 


4 Simulation Study 


Using PSCAD simulation software, build a main circuit of three-phase VSR and a 
dual closed-loop control circuit based on SVPWM and SPWM. The main parameters 
of the system: The AC source with frequency of 50Hz, the grid-side line voltage is 
380V, the line inductance of each phase is 2 mH , the AC side resistance is 0.1 Q, the 
reference value of DC voltage is 700V, the DC capacitor is 3000 WF , the load resis- 


tor is 30Q and the switching frequency of VSR is 5 kHz . 

Use SVPWM and SPWM respectively, the DC bus voltage is shown in Fig.5, Fig.6 
(the units of voltage and current are KV and KA respectively), we can see that the DC 
bus voltage is stable at a given value of 700V. 

Voltage and current waveforms of phase A are shown in Fig.7,Fig. 8, we can see 
that the phase angles of phase voltage and phase current are the same, operating at 
unity power factor. Comparison of Fig. 7 and Fig.8 can be found that the current 
waveform is sine wave in Fig.7, while the waveform distortion Obviously in Fig.8, 
according to FFT analysis, as shown in Fig.9, Fig.10,we can see that when using the 
SVPWM, the harmonic current components content of phase A is zero, while using 
the SPWM, the harmonic current components content is higher, especially the 6k +1 
harmonics (k =1,2---,n). 
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Fig. 7. J, and U g using SVPWM 
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Fig. 9. FFT analysis using SVPWM 


5 Conclusion 
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Fig. 8. J, and Ug using SPWM 
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Fig. 10. FFT analysis using SPWM 


This paper proposed a feedforward decoupling dual close-loop control scheme for the 
three-phase VSR. Computer simulation results have verified that the controller can 
offer an excellent steady-state performance, fast dynamic response, as well as the 
convenient active and reactive power decoupled controls. The SVPWM rectifier has 
the advantage of less harmonic content ,easier digital realization, which will be wide- 


ly used in the industry. 
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Abstract. The reliability of distribution system of electricity is one of the most 
important indicators to measure the ability that the power supply system sends 
electric power to the load continually. Also it is an important technical and eco- 
nomic indicator for the power supply enterprise. In this paper, the theory about 
the reliability of distribution power system have been studied, a method of net- 
work equivalent is introduced. The validity of this method is examined by an 
example of distribution system of Pudong International Airport. Finally, the ar- 
ticle analyses a number of factors which may influence the indicators of 
reliability. 


Keywords: distribution system; reliability evaluation; network equivalent. 


1 Introduction 


Reliability one of the important indicators in power system is reflected in the 
electricity power industry to meet the needs of the national economy. With the devel- 
opment of social production and people's living standard demand for electricity is 
growing. Reliability now has become equally important power quality standards with 
the voltage, frequency and so on. 

The composition of power system is extremely complex. There are so many com- 
ponents in the system and the structure is precision, variable. Meanwhile, the power 
grid exposes to the natural environment, it is for these reasons and their own external 
interference, power accidents have occurred, seriously affects the load to acquire 
power continuously, making the country suffered great economic losses. Distribution 
system supplies directly to end users. Frequent failure of its equipment, caused 80% 
of our power blackout [1]. 


2 A Method to Evaluate Reliability of Distribution Network Based 
on Network Equivalent 


Distribution system reliability evaluation algorithm can be divided into two catego- 
ries: analytical method and simulation method. Analytical method use fault enumera- 
tion for the selection and estimation of state, establish a more rigorous mathematical 
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model to calculate the reliability index. Simulation carries out the state's options and 
estimation by sampling the probability distribution of the components to obtain relia- 
bility index using the method of reliability. 

Distribution network reliability assessment is to traverse all the basic principles of 
component failure on the reliability of all load points, calculate the load point reliabil- 
ity indexes, and finally integrated all the load point reliability indexes deriving system 
reliability. Concrete steps generally are: 


e Simplify complex network to the formation of a simple radial network 
e According to component reliability index, obtain the equivalent load point in- 
dex, and indicators of the whole system. 


2.1 Simplify the Complex Network 


In the process of distribution network reliability evaluation, the device can be divided 
into non-repairable and repairable components. For the specific component in a dis- 
tribution network, contains normal state and fault state. For the protection compo- 
nents, assuming they all work at 100% reliability. Load is the average load model. 

Distribution system usually uses simple radiation network to supply the consumers. 
In both ends power supply system, the normal form generally will open the switch to 
cut off two simple radiation networks. This network is characterized by all the com- 
ponents are in series, the branch circuit components and the main trunk line segments 
are connected in series too. 

A number of component feeders form this type of radiation network structure. First 
of all, divide the network into some layers according to the number of feeder, each 
feeder and various components connected to the feeder is a layer. Each layer of the 
network can be equivalent to an equivalent branch line, gradually upward from the 
end of the equivalent layer, and finally you can translate a number of complex net- 
works with branch feeders to a simple radial network [2][3]. 


2.2 Network Equivalent and Calculation of Indexes 


Assessment of Complex distribution network reliability, including two equivalent 
processes: upwards and downwards equivalent. In the process of upwards equivalent, 
the branch feeder line will be replaced by an equivalent node component at a higher 
level feeder. The impact of higher-level components which is brought by equivalent 
elements can be represented by the failure rate, failure time of power supplyU , 
fault repair time7. In the process of downwards equivalent, the impact of feeder in 
the low-level which is brought by the feeder in high-level will be replaced by an equiv- 
alent component at the beginning of low-level feeder [3][4]. 
1) The branch feeder with circuit breaker 


A=(1-P)YY A, 
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A, is the failure rate of node k in the branch feeder; P is the reliability of circuit 


breaker at the beginning of branch feeder; f, is the working time of disconnection 


switch which is couple with the circuit breaker in the branch feeder. 
2) The branch feeder without circuit breaker 


k=1 
n 
| San 
| Z = = 
pan 
k=1 
n 
C= SAF, 
k=1 


A, is the failure rate of node k in the branch feeder; 7, is the repair time of feeder 
which is caused by the failure of node k. 


For the system of simple radial feeder, reliable indicators of the load node can be 
calculated as follows: 


io A AAD 


k#i,k=1 


U.= VA + Athy + Ayr 


it” it 
k#itik k=l 
A; is the failure rate of node k in the branch feeder; 7, is the repair time of node i 


which is caused by node k. Fr, relies on the structure of system and the position rela- 


tionship between power source and component i. 
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With the above formulas, in accordance with the foregoing, we can easily get the 
index, constitute a new simple radial network, and all the load point reliability indexes 
of the whole system can be obtained [5][6]. 


3 Example Calculation 


The example in this paper bases on a distribution power system of Pudong Interna- 
tional Airport. 


3.1 Raw Data 


This power system contains three inlet wires, they are JiXiang4068, ZhenHang913, 
and JiXiang4053. Their voltage is 35kv, after passing three main transformers: 
#1 #2 #3, each of inlet wires divides into two lines, constitute six 10kv buses. Each 
of these buses has nine feeders to the load. Every subsystem of 35kv can be connected 
to a ring by circuit breakers, but they will not operate in the ring at normal time. The 
indexes of main devices are listed as fellows: 


Table 1. Component reliability data. 


Component reliability data 
Equipment-forced outages 
Component Classes Failure rate Repair time 
( time per year) ( hour) 
35KV 0.2865 9 
circuit breaker 1OKV 0.048 4 
0.4KV 0.03 3 
35KV 0.3333 2.5 
transformer 
10KV 0.1 15 
bus 0.1 3 


3.2 Assumptions 


e The generating system and transmission system are all reliable, and the genera- 
tion system always can meet the requirements of load. 
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The circuit breakers are installed at the feeder, the tie lines between buses and 
transformers contain circuit breakers. All of these circuit breakers can work 
reliable. 

The high-level substation of JiXiang4068 and JiXiang4053 is JiChang substa- 
tion. So the feeder of JiChang substation may have three programs: 


a) JiXiang4068 and JiXiang4053 come from the same bus. 

b) JiXiang4068 and JiXiang4053 come from the same feeder which divides 
into two lines leading to two buses. 

c) JiXiang4068 and JiXiang4053 come from two different feeders; each of 
these feeders leads to two buses. 


Obviously, the reliability of the three programs is different, program c) is the highest, 
but program a) is the lowest. However, the relationship between the three programs is 
the same when system operating at non-ring state. 


3.3. Networks and Calculation of Indexes 


This distribution system is a radial network, upwards equivalent to the 10kv side of 
transformer #1 which leads to two buses through circuit breakers. Now, the system 
should be discussed in two conditions. 


Assuming that the network operating at Open-loop, and manually switching the 
breakers. Each inlet wire and two distribution buses build a small system. On the 
basis of previous step that the load has been equivalent to the distribution buses, 
assuming that the bifurcation on the 10kv side of main transformer is a node. 
Than the load node should be equivalent to this bifurcation. 

When open-loop operation, if there are some fault happen in the small network, 
relay will operate, the circuit breaker on the key point may open. This condition 
should be considered. If the bus tie breaker is automatic switching mode, the bus 
tie line can connect with adjacent small 35kv system automatically. The fault 
bus can restore operation. This condition looks like a shunt system, now it 
should be divided into two programs: 


a) If the inlet wires of adjacent buses come from different high-level substa- 
tion, we can only consider these two buses. This program has higher power 
supply reliability. 

b) If the inlet wire of adjacent buses come from the same substation, and this 
inlet wire contains a bifurcation. It is obvious that this kind of program’s 
power supply reliability is lower. 


According to the simplification of network, we can calculate the reliability of this 
distribution system by some common indicators. 
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4 Calculation Results and Analysis 


After the calculation of example system, the result listed as fellow table: 


Table 1. Reliability index 


From the table above, we can know: 


° In the first condition, manually switching mode, its ASAT is 0.99948. And in the 
other two conditions, they all use automatic device, their ASAI are higher than 
the first condition. The switching time is usually 2 minutes (automatic switch- 
ing) or 30 minutes (manual switching). If we ignore the delay of automatic 
switching, the manual switching should be considered as a non-backup system. 

e In the case of automatically switching system, the source of distribution network 
may influence the reliability. If the mutual backup two inlet wires come from 
different source, its reliability mustn’t lower than the backup line come from dif- 
ferent source. From the table, we know that in the second condition, assuming 
that the ZhenHang913 is the backup line of JiXiang4068, its ASAT is 0.999714. 
In the third condition, though the final backup is still ZhenHang913, as the non- 
adjacent buses, JiXiang4068’s backup bus is JiXiang4053, they come from the 
same source. So if ZhenHang operates as backup, it will pass more breakers and 
buses. The ASAI of whole system is lower. 


Now we analyze reliability of system from same source. JiXiang4068 and JixX- 
iang4053 lead from the high-level substation, they may have three programs. Usually, 
the less devices the more reliability. But in the case of different feeders of the same 
source, under this adverse factor, if we can divide these two feeders’ relationship, the 
reliability will be higher. C) minimizes the relationship between feeders. 


5 Conclusion 


This paper introduce a method to evaluate reliability of distribution network based on 
network equivalent, its key point is the equivalent of complex network. From the 
example system, we can see that different backup method and different bus arrange- 
ment will exert significant influence on the reliability index. This conclusion suggests 
us to improve system reliability by the following visual angles: 
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e Power system should use automatic device, they are good for fast action, streng- 
then system’s backup ability, and improving the reliability of overall power grid. 

e — If it is permitted, use some security network construction such as dual power 
supply, sectionalized buses. This will help to reduce the probability of power 
outages, reduce the extent of failure and enhance ability of sustainable supply. 


Consider the security, reliability and economy of whole grid, arrangement the system 
operation reasonable, give full play to the performance, guarantee power quality ef- 
fectively, and avoid devices overload and other accidents caused by unreasonable 
arrangement of grid operation [8][9]. 
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Abstract. In this paper, we focus on the effective coverage benefits with the 
deployment of mobile relay stations in the next generation wireless cellular 
networks based on call dropping probability discussion. The special scenario is 
defined that mobile relay stations are applied on carriages of a fast-moving train 
to improve the quality of communication for user equipments on the train along 
railway. According to the analysis of effective coverage area performance based 
on the variation in received signal power over distance in path loss and shadow- 
ing with corresponding performance evaluations, it is shown that under the 
same conditions the effective coverage area with relay stations in the wireless 
cellular network outperforms that of without relay stations, and the higher the 
call dropping probability based on user equipment is, the higher BS antenna 
height is, and the wider area the BS covers. 


Keywords: mobile relay station, path loss, shadowing, effective coverage area, 
call dropping probability. 


1 Introduction 


3GPP proposed LTE-Advanced as the further evolution of LTE. One of the evolution 
goals is to fulfill and even surpass all of the IMT-Advanced requirements on capacity, 
data rates, latency, spectrum efficiency and low-cost deployment, etc. In order to 
reach the requirements for the evolution of LTE, 3GPP proposed some technology 
components in LTE-Advanced system. One of the potential key technology compo- 
nents is relay technology. By inserting relay stations (RSs), relay technology can 
effectively increase data rates, extend coverage area, expand system capacity, im- 
prove spectrum efficiency, combat fading, and enhance system robustness, etc. In 
addition, it also deploys the wireless cellular network in a cost efficient manner and 
thus cuts down OPEX and CAPEX. Relay technology has been studied and consi- 
dered in the standardization process of 3GPP LTE-Advanced. Relaying is considered 
for LTE-Advanced as a tool to improve e.g. the coverage of high data rates, group 
mobility, temporary network deployment, cell-edge throughput and to provide cover- 
age in new areas [1]. 
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The so-called relay technology, taking the simple two-hop relay for example, is that 
a direct bad link from UE(User Equipment) to BS(Base Station) or from BS to UE 
can be broken down into two shorter better links, a UE-RS link and a RS-BS link, or a 
BS-RS link and a RS-UE link. In this case, a UE is able to communicate directly with 
the BS or indirectly uses some relay stations to relay its signals to the BS. 

In [2] two different types of relaying network architecture have been investigated. 
One is proposed to use others users’ terminal (Mobile Relay Station, MRS) to relay 
traffic while the other is proposed to use Fixed Relay Station (FRS) to relay traffic. 
The two different concepts are shown in Fig.1.Lots of studies have been done on 
relaying system with either FRS or MRS [3-12]. Compared with FRS, MRS can pro- 
vide more flexibility to wireless cellular network [13]. 


FRS 


(a) FRS (b) MRS 


Fig. 1. Cellular networks with relaying. 


In this paper, we focus on the effective coverage benefits based on call dropping 
probability discussion with the deployment of MRSs in the next generation wireless 
cellular networks. There are mainly two different types of scenarios to deploy MRSs 
in wireless cellular networks. One is to make MRSs deployed on moving vehicles, 
such as trains, buses, cars, etc, to cover areas in/on/outside the vehicles. The other is 
to make the non-active UE (i.e. in idle state) to relay the signals between the active 
UE to BS, acting as a MRS. We choose the first type of scenario that MRSs are dep- 
loyed on a fast-moving train as the scenario to be discussed. In the following discus- 
sion we assume that MRSs all work in AF (Amplify-and-Forward) relaying way. 
Considering the variation in received signal power due to path loss and shadowing, 
we analyze the call dropping probability based on UE (CDPUE) performance and 
CDPUE-based effective coverage area performance. Comparing with the convention- 
al wireless cellular network without RS, the results show that under certain 
conditions both the CDPUE performance and CDPUE-based effective coverage per- 
formance are improved. Meanwhile, the performance gain under different antenna 
height is also compared. 

In this paper, the system model is specified in section 2. Then the CDPUE-based 
effective coverage area analysis is discussed in section 3. And the performance 
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evaluations and discussions are provided in section 4. Finally, a conclusion is drawn 
in section 5. 


2 System Model 


The following analysis is based on the specified scene of adopting RSs deployed on a 
fast-moving train. The scenario of relay-assisted is shown in Fig.2. By the RSs, UEs 
on the train can communicate with BS. Characteristics for the wireless cellular net- 
work along the railway are specified now. 


* Low traffic of communication. Thereby, the coverage is the main issue that cut 
down OPEX and CAPEX. 

* Deployment of Macrocell to cover wide area in a cell. 

* High-speed mobility of UEs. Due to the mobility of UEs relative to the serving 
BS, the received signal may have a Doppler shift. 


i 
-_s 


BS UE 


Fig. 2. Relay-assisted scenario in downlink. 


According to the summarized characteristics above, MRSs are adopted to improve 
the effective coverage and communication quality on the train. Two types of RSs 
have been defined in 3GPP LTE-Advanced, Type- I and Type- II. Depending on the 
relaying strategy, a relay may be part of the donor cell and control cells of its own [1]. 
In this paper, we mainly consider a RS to be part of the wireless cellular network. In 
this case, smart repeaters, decode-and-forward relays and different types of L2 relays 
are examples of this type of relaying. The relay station is wirelessly connected to 
radio-access network via a donor cell. The connection can be inband, in which case 
the BS-to-RS link shares the same band with direct BS-to-UE links, or outband, in 
which case the BS-to-RS link does not operate in the same band as direct BS-to-UE 
links [1]. In our analysis, we consider the inband RSs. 

RSs are considered as smart repeaters, and MRSs relay signals using AF (Amplify- 
and-Forward) relaying. For the inband RSs, since a wireless device cannot simulta- 
neously transmit and receive signals at the same frequency channel, we consider the 
time division case that RSs receive and transmit in different time.. 

Consider the downlink relay channel from a BS to a UE via a RS, as shown in 
Fig.2. It is assumed that Rayleigh fading is between BS and RS and Rice fading is 


150 W. Zheng, R. Zhao, and D. Su 


between RS and UE. In the first time slot, the signal transmitted by a BS is received 
by a RS, which is: 


yp =P, hx, + (1) 


In equation (1), x; represents the signal with the transmission power P,, and h, 
represents the channel coefficient from BS to RS in Rayleigh distribution, and n,; 
represents the corresponding Gaussian noise with zero mean and variance No. 

Then, during the second time slot the RS amplifies y; and retransmits it to the des- 
tination UE. The UE receives: 


yo =AJP. hy, +g (2) 


In equation (2), A represents the amplification factor to scale the transmission power 
by the RS, P, represents the power transmitted through the RS, hz represents the 
channel coefficient from RS to UE in Rice distribution, and nz represents the corres- 
ponding Gaussian noise with zero mean and variance Np. It is assumed that the RS 
and the UE receiver chains have identical noise properties. 


3 Performance Analysis 


The following analysis is to discuss the received signal power variation for the reason 
of path loss and shadowing over distance. 

The COST 231 extension of HATA model is adopted as the pass loss model (in 
dB), which is [14]: 


L(d)dB = 463+33.91g f, -13.82lgh, —ath,) +(44.9-6.55|gh,)lgd + Cy (3) 


a(h,) =3.2(g11.75h,)* —4.97 (4) 


In equation (3), d represents the distance from transmitter to receiver, f. represents the 
carrier frequency, h/h, represents the Tx/Rx antenna height, and Cy is a constant 
factor for suburbs, OdB. 

The log-normal shadowing model is adopted due to shadow fading, which is the 
most widely used statistical model. In the model the path loss y is assumed random 
changed with a log-normal distribution referred to [14]. 


(Oly = My 4p)” 


PY) = a exp 5 
220y VW 20, 


y>0 (5) 


In equation (5),¢=10/In10, “,,, refers to the mean of y,, = 10lgy in dB and oy, 


refers to the standard deviation of y,, in dB. 


Then we can obtain the mean of wy (the linear average path loss) from (5) due to 
[14]. 


2.8, 
= E = Vas Vas (6) 
Lu, = Ely] oo] Hae S| 
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The conversion from the linear mean to the log mean (in dB) is derived from (6) 
due to [14]. 


Ovi 
leg = hy.t— (7) 


Based on (5)-(7) and with changes of variables we get the Gaussian distribution of y 


with mean Hy oy and standard deviation Ov [14]: 


(Wap —Ly..)” 
1 exp Vas Hy wp 


210 y 1, 20), 


(8) 


Models for path loss and shadowing are typically superimposed to capture power 
falloff versus distance along with the random attenuation about the path loss from 
shadowing. According to the combined model the received power in dB is given by: 


P, /P, (dB) =-PL—Wag (9) 


In equation (9), P, represents the received power, P, represents the transmission pow- 
er, PL represents the pass loss in (3), and y,, represents a Gauss-distributed random 


variable with mean zero and variance Gye : 


Based on models described above, we discuss call dropping probability based on 
UE (CDPUE) and CDPUE-based effective coverage area. CDPUE is the call drop- 
ping probability in the downlink that the received power of UE falls below a target 
minimum received power level. In wireless systems there is typically a target mini- 
mum received power level P,,,;, below which performance becomes unacceptable. The 
call dropping probability is defined based on UE CDPUE(P yin, d) under path loss and 
shadowing to be the probability that the received power of UE at a given distance d, 
P,(d), falls below Pinint: CDPUE(P nin, d) = p(P(d) < Pin). For the combined path loss 
and shadowing model it can be expressed by the following equation: 


CDPUE(P,,,,4) = 1— Q((Prin — (BP - PL))/o,,,, ) (10) 

In equation (10), the Q function represents the probability that a Gaussian random 
variable X with mean zero and variance one is bigger than z [14]: 

“1 

Q(@)= x >= [| — 

P z 


20 


exp(— y? /2)dy (11) 
CDPUE-based effective coverage area refers to the coverage area of the wireless 


cellular network along the railway when the wireless cellular network along the rail- 
way is under the condition of different CDPUE requirement. 


4 Results and Discussion 


Based on the scenario and analysis described above, the results of performance evalu- 
ations are present in this section. It is assumed that there is one MRS on every 
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carriage to cover the carriage, and each UE in a carriage selects the MRS which cov- 
ers the carriage as its relay. To evaluate the performance improvement, the assump- 
tions are same both to the conventional cellular network without relaying and our 
analytical model with relaying. For performance evaluations, the parameters relevant 
are listed in Table1. 


Table 1. Parameters. 


Parameter Value 
Carrier Frequency 2GHz 
UE Antenna Height 1.5m 
BS Antenna Height 30m/100m 
RS Antenna Height Sm 
Radius of Cell 2000m 
UE Transmit Power 23dBm 
BS Transmit Power 43dBm 
RS Transmit Power 37dBm 
Pmin of UE -97dBm 
Shadowing standand deviation 8dB 


CDPUE 
& 
h 
» 


1 111213141516171619 2 
BS and UE(km) 


Fig. 3. CDPUE versus the distance between UE and BS comparison. 


When a train is run, a UE receives signals from a serving BS in cell with or without 
relays. Fig. 3 shows the CDPUE on the train in a single cell changing with the dis- 
tance between UE and BS for the direct path without relays and the relayed path with 
relays respectively. Furthermore, to evaluate the effect about different antenna 
heights, the CDPUE changing with different conditions of BS antenna height (when 
BS antenna height is 30 meters and 100 meters) for both the direct path and the re- 
layed path is also shown in Fig.3. It is observed that the CDPUE performance in the 
relay-assisted wireless cellular network is better than the conventional wireless cellu- 
lar network without relays. And, the CDPUE performance under the condition of the 
BS antenna height 100 meters is better than the condition of 30 meters for both in 
direct path and relayed path. 

Fig. 4 shows the effective coverage area corresponded to CDPUE changing for the 
direct path without relays and the relayed path with relays respectively. Meanwhile, 
the curves of the effective coverage area versus the CDPUE under the condition of 


Effective Coverage Area Based on Call Dropping Probability 153 


different BS antenna heights, both 30 meters and 100 meters, are presented to eva- 
luate the effect of different BS antenna heights on the effective coverage area, for 
both direct path and relayed path. We can obtain from the curves that under the same 
conditions the effective coverage area with RSs in the wireless cellular network out- 
performs that of without RSs. Also, it is observed from the curves that different BS 
antenna heights has different impact on the effective coverage area, and the wireless 
cellular network under the condition of 100 meters BS antenna height has better effec- 
tive coverage area than that of 30 meters BS antenna height. Then we can deduce that 
under the same condition the higher CDPUE is, the higher BS antenna height is, and 
the wider area the BS covers. 


0 0.050. 10. 150.20.250.30.360.40.450.50.550 60.650. 
CDPUE 


'5080,850.90.95 1 


Fig. 4. The effective coverage area versus the CDPUE comparison. 


5 Conclusions 


In this paper, we focus on the special scenario that MRSs are deployed on carriages of 
a fast-moving train, and discuss the performances improvement of the CDPUE and 
the CDPUE-based effective coverage area brought in relay-assisted scenario. By per- 
formance evaluation and comparison, it is concluded that the CDPUE performance in 
the relay-assisted wireless cellular network is better than the conventional wireless 
cellular network without relays. And under the same conditions the effective coverage 
area with RSs in the wireless cellular network outperforms that of without RSs. Also, 
the numerical result shows that the effective coverage area difference comes from 
different BS antenna heights. According to the discussion above, it is deduced that 
under the same condition the higher CDPUE is, the higher BS antenna height is, and 
the wider area the BS covers. 
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Abstract. This paper investigates the joint impact of variations of the frame rate 
and quantization on the coding bit-rate. Through theoretically linking the frame 
rate and the quantization step to the mean absolute difference (MAD), the MAD 
is modeled by a 2-D function, based on which a 2-D rate model is proposed. 
Extensive experimental results have demonstrated the effectiveness of the pro- 
posed 2-D rate model. Together with a 2-D distortion model, the proposed rate 
model can serve as the basis for advanced rate control in video coding and rate- 
constrained scalable video coding as well as bitstream adaptation. 


Keywords: 2-D rate model, rate control, mean absolute difference (MAD), 
frame rate, quantization step. 


1 Introduction 


Rate model plays an important role in video coding and transmission. Using an accu- 
rate rate model, optimized encoding parameters can be obtained ahead of coding. The 
relationship between the rate and the distortion for texture coding has been given a 
considerable amount of attention for video coding, such as in early schemes TM5 
where a simple first-order linear rate-quantization (R-Q) model is employed for rate 
control [1]. The quadratic R-Q models become popular in later schemes, providing 
better performance than the linear model at the price of higher computational com- 
plexity in TMN8 and VM8 [2][3]. Recently, He and Mitra have proposed a 
p -domain model that estimates the rate, where p indicates the percentage of zero 


coefficients after quantization [4]. Many other R-Q models have also been developed 
along with the development of rate control schemes for video coding [5][6], with the 
same aim of adjusting the quantization parameter (QP) given a targeted bit-rate. 

In low bit-rate applications such as wireless video, however, heavy quantization 
and temporal resolution reduction (i.e. skipping frames) are usually introduced in 
video coding to adapt to the limited bandwidth. As a result, quality degradation is 
inevitable in both the spatial and the temporal domain. In particular, frame dropping 
causes jitter/jerkiness for human perception. And in existing schemes, frame skipping 
is always employed when the buffer tends to overflow. This is a rather passive 
process and the result may impair the overall rate-distortion (RD) performance and 
causing incoherency in motion considering perceived video quality. Therefore the 
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encoder should determine the frame rate together with the QP for a target bit rate, to 
optimize the rate-distortion performance. Considering the overall RD performance, 
the encoder should determine whether to encode the video at a higher frame rate but 
with significant quantization, or to encode at a lower frame with less quantization. 

On the other hand, in scalable video coding (SVC), temporal, spatial and quality 
scalabilities are implemented to make the video stream scalable. It allows transmis- 
sion and decoding of partial streams according to varying network conditions. In 
SVC, a bitstream extractor is needed to extract sub-bitstream. A very simple and inef- 
ficient method is to randomly discard bitstream units until the desired bit rate is 
achieved. For better performance, rate-distortion optimized extraction method can be 
used. No matter what kind of methods are to be employed, an analytical rate model is 
useful for a computational solution if without resorting to exhaustive search. 

To solve the above problems, a more advanced two-dimensional (2-D) rate model 

is needed, where the relationship between the bit-rate, the QP and the frame rate is 
formulated. The integration of temporal resolution with quantization on the perceptual 
quality has been addressed in [7][8]. However, their joint influence on the rate is still 
not clear. To the best of the authors’ knowledge, the only report on the 2-D rate model 
is with regard to scalable video coding [9], where the model is empirically developed 
according to experimental observations. 
In this paper, we address how to build a 2-D rate model. In the proposed work, exten- 
sive experiments and theoretical analysis are performed in modeling the impact of the 
quantization step and the frame rate on the bit-rate. Through linking the frame rate 
and the quantization step to the mean absolute difference (MAD), the coding bit-rate 
is modeled as a 2-D function. Using the proposed 2-D rate model, the encoding frame 
rate as well as QP can be determined given a targeted bit-rate for video coding. Such a 
model is also useful for bitstream extraction in scalable video coding. 

The reminder of this paper is organized as follows. The proposed 2-D rate model is 
described in section 2. To validate the performance of the proposed model, experi- 
mental results are reported in section 3. Section 4 closes this paper with concluding 
remarks. 


2 Two-Dimensional Rate Model 


After encoding a video sequence, the average bit-rate (R ) of the sequence can be 
expressed as 


R= fXLR =fXR, (1) 


where K is the number of coded frames, f is the original frame rate, and R is the 
average number of bits per frames. Intuitively, R decreases linearly as f decreases, 
however, changing f will impact statistics of the residual because of predictive cod- 
ing, and then R’ will be influenced, which makes this problem complicated. In [10], 


it has been confirmed that R increase as f decreases according to experimental 
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results. In the following we will quantitatively investigate the impact of changing f 


on R , based on the quadratic R-Q model. Then the impact of the frame rate on the 
bit-rate can be evaluated. 


2.1 Analysis of Quadratic Rate Model 


One of the most widely adopted R-Q models in video coding presents itself in a qua- 
dratic polynomial [3]. An improved version of the quadratic R-Q model distinguishes 
header bits and texture bits, with the MAD introduced as the complexity measure 
[11]: 

R ocne = MADX(a,Xq°' +b,Xq~), (2) 


texture 


where R 


texture 


denotes the amount of bits to encode the texture information, g is the 
quantization step, a, and b, are parameters, and MAD is computed using the mo- 
tion-compensated residual for the luminance component. 

The MAD varies with the following two factors in video coding. (i) Frame rate. 
Encoding at a lower frame rate is equivalent to down-sampling the original sequence 
in the temporal domain. Through predictive coding, the MAD after motion estimation 
would be different for the same video scene at different frame rates. (ii) Quantization 
step. In the encoding process, reconstructed frames will be used as reference. There- 
fore, different quantization steps will lead to different distortions, resulting in differ- 
ent MAD values. Armed with the above arguments, the relationship between the 
MAD and the frame rate as well as the quantization step is built, based on which a 
new two-dimensional rate model is proposed. 


2.2 Modeling MAD and Frame Rate of Original Video 


Let f(x, y) represent the pixel value at (x, y) in the nth frame in the original video 
sequence, M and N be the width and height of a frame respectively, and MAD, ,, be 


the MAD value between the mth frame and the nth frame. Denoting the horizontal 
and the vertical components of the motion vector at (x, y) in the mth frame which 


references the nth frame as mvx, ,,(x, y) and mvy 


MAD, 31 = : bes 


MXN GS 


(x, y), respectively. Then 


nm 


Tie (x+ MVX yp ni Xs y), y + IVY py nai (X y)) = di (x, y) (3) 


1 
MAD, ny = ac oe 


x oy 


fies (x+ TVX st n42 (x, y), y ae IVY nst.n42 (x, y)) = Fost (x, y) ‘ (4) 


Suppose that the motion vectors of successive frames are coherent, which holds for 
most frames with similar motion and background in the same video scene. Consider- 
ing the MAD between the nth and the (n+2)th frame 
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According to inequality rules, and expending (5), we can obtain 


MAD, ,,, = @(MAD, ,,, + MAD,,,, ,,,)+ B(MAD, ,, ,,, - MAD 


nyn+1 ntl,nt+2 n+1,nt+2 aca) 


+ (MAD MAD 


nyntl ~~ eae.) 


, (6) 


where a@ , Band ye (0,1), and a+ f+y=1. Here a, f andy are probability val- 


ues, corresponding to each component in (6). 
In the same video scene with motion coherence, it can be assumed that 


MAD 


nntl 


= MAD 


ntl,nt+2 = MAD,,, 7 (7) 
So (6) can be expressed as 
MAD, 44 = 2a@MAD,,, Z (8) 


Based on the above analysis and extend (8) for any m and n, we have 


MAD,,, m-n=1 
MA nm ? 
(m—n-l1)AMAD,,, m-n=2 


org 


(9) 


where Ae (0,1). Accordingly, the average MAD of all frames in a sequence can be 
formulated as a function of the frame rate as follows: 


1 
MAD(t) = a,x +b,, (10) 
t 


where a, and b, are model parameters. 7,,, and ¢ in (10) denote the maximum 


frame rate and the frame rate, respectively. 

To verify the (10), experiments have been carried out using the H.264 reference 
software JM8.6 [12]. Four video sequences in CIF (352 Xx 288), i.e., “Mother- 
daughter’, “Container”, “Mobile”, “News”, were encoded under the frame rates of 
Sfps, 7.5fps, 15fps, 30fps, respectively. Here 30fps is the maximum frame rate. The 
reference frame is not the reconstructed signal but the corresponding original signal. 
In all simulations given in this paper, the first IDR frame was intra-coded only, and 
the remaining frames in a video sequence were inter-coded (P-frame) with all availa- 
ble modes. The number of reference frame was 1. The experimental results are plotted 
in Figure 1, where a linear relationship between the MAD of the original sequence 
and the reciprocal of frame rate can be generally observed. 
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Fig. 1. The relationship between frame rate and MAD of original video 


2.3 Modeling MAD and Quantization Step 


To see how quantization step influences the MAD, we encoded several test video 
sequences using the H.264 reference software JM8.6 [12] and measured the actual 
MAD corresponding to different quantization steps. Specifically, four video se- 
quences , “Mother-daughter’’, “Container”, “Mobile”, “News”, all in CIF (352 x 288), 
were encoded with the quantization parameter (QP) from 15 to 36. In the results re- 
ported in this paper, the frame rate are 5fps, 7.5fps, 15fps, 30fps. Using the H.264 


mapping between g and QP, ie.,g =2°°*’°, the corresponding quantization step 


can be computed. The experimental results are plotted in Figure 2. The observations 
suggest a linear relationship between the MAD and the quantization step at a certain 
frame rate. 

Therefore, at a certain frame rate, the average MAD varies with the quantization 
step as 


MAD(q) =a, Xq+b,, (11) 


where a, and b, are parameters, and q is the quantization step. 


2.4 The Proposed 2-D Rate Model 


According to the above theoretical analysis and experimental observations, combining 
formula (10) and (11), we propose the following model: 


T 
MAD(q,t)=axqtbx— +c, (12) 
t 


where a,b and c are model parameters, gq and ft donate the quantization step and 
the frame rate, respectively, and T.,, is the maximum frame rate. 
Then substituting (12) into (2), the coding bit-rate can be modeled as 
R(q,t) = MAD(q,t)x(X,Xq''+X,Xq°) 


T, 4 aed (13) 
=(axqtbx—S+c)x(X,xq +X,Xq°) 
t 
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Fig. 2. The relationship between MAD and quantization step(a)“Mother-daughter” 
(b)’’Container” (c) Mobile” (d) “News 


where X, and X, are model parameters. This rate model is two-dimensional which 
varies with the frame rate as well as the quantization step. 


3 Experimental Results 


The performance of the proposed 2-D rate model has been evaluated by using the 
H.264 video codec of reference software JM8.6 [12]. Standardized test sequences in 
CIF (352 x 288) were used. For all considered sequences, the coding video frame 
rates were 30f/s, 15f/s, 7.5f/s, 5f/s, respectively, and 30f/s was the maximum frame 
rate. Each sequence was coded with the QP of 15, 20, 25, and 30, respectively. Using 
the H.264 mapping between g and QP, the corresponding quantization steps were 
3.56, 6.35, 11.31, and 20.16. For the sake of conciseness the results reported in this 
paper include only four test sequences: “Mother-daughter’, “Container”, “Mobile”, 
“News”. The model parameters a, b and c, were obtained by linear regression using 
the measured and predicted MAD corresponding to all g and ¢ in (12). Parameters 


X,and X, were obtained by linear regression using (13). The actual rate data of four 
test sequences with different combinations of g and ¢ , and the corresponding 
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estimated rates via the proposed model (13) are illustrated in Figure 3. RM denotes 
the value of predicted rates. From Figure 3 we observe that the model predictions fit 
well with the experimental rate points, with an average Pearson correlation coefficient 
(PCC) of 0.9917 and average relative error (RE) of only 8.38% over four video se- 
quence. Table 1 listed the RE and the PCC between measured and predicted rates. 


R (bit/pix) 


Fig. 3. Measured rate points and predicted rates using the 2-D rate model (13).(a)’Mother- 
daughter” (b)’Container” (c)’’Mobile” (d)”News”. 


Table 1. PCC and RE between mesured and predicted rates 


News Container Mother-daughter Mobile 
PCC 0.9957 0.9855 0.9993 0.9866 
RE 0.0923 0.1167 0.0301 0.0961 


4 Conclusions 


In this paper the impact of the frame rate and the quantization step on the MAD is 
examined and modeled through theoretical analysis and experimental validation. The 
corresponding model is brought into the quadratic R-Q model to obtain a new 2-D 
rate model, in terms of the quantization step and the frame rate. The accuracy of the 
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proposed 2-D rate model has been verified. Together with a 2-D distortion model, the 
proposed rate model is not only useful for non-scalable video encoder to determine 
the encoding frame rate and QP for a target bit-rate, but also useful in scalable video 
encoding and bitstream extraction. 
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Abstract. This paper explored the transmit antenna scanning modulation effect 
on the signal to noise ratio (SNR) loss in a noncooperative illuminator based 
bistatic radar. The general expression of normalized filed pattern for widely 
used antenna was analyzed. The direct-path signal and target echo modulated 
by the transmit antenna, which is mechanically scanned in azimuth, were de- 
scribed. Mathematical representations for direct-path signal and target echo 
were proposed with respect to the analytical expression of antenna’s pattern. 
The SNR loss of the coherent integration via cross-correlation processing was 
derived. The relationship between SNR loss and SNRs of direct-path channel 
and target echo channel was evaluated. The simulation result shows that the 
SNR loss caused by antenna modulation can be ignored if the average SNR in 
direct path is greater than 15 dB. However, the SNR loss deteriorates rapidly if 
the visual angle becomes larger. Some suggestions on choosing the IF band- 
width and data selection were presented. 


Keywords: Passive Bistatic Radar; Coherent Integration Loss; Antenna Pattern; 
Scanning Modulation Effect. 


1 Introduction 


In recent years, target detection and tracking systems based on illuminators of opportu- 
nity have received significant attention and resources, which is so-called passive 
coherent location system[1-3]. The well known advantages of such systems over con- 
ventional monostatic radar include low cost, immunity to ECM threats, etc. Most of the 
systems exploit radio and television broadcasting signals as noncooperative emitters. 
Due to the transmitting antennas of these kind of system is omnidirectional, there is no 
need to consider the effect of modulation caused by the scanning of the transmit 
antenna [4-6]. 

However, in the passive bistatic radar system which using radar emitters as sources 
of opportunity, it is desired to take the effect of modulation into account introduced by 
mechanical scanning antenna. Even if the system is synchronized in space, the inter- 
cepted the direct path wave and the target echo are modulated by different lobes of the 
transmitting antenna. Then, the gain achieved by coherent integration will be deteri- 
orated. Therefore, the SNR’s improvement factor will be limited by the transmitting 
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antenna scanning modulation[7]. [8]discussed the problems for cooperative bistatic 
radar from the relationship of the probability density of the target echo and its recep- 
tion angle. The loss caused by radar antenna pattern in different geometric configura- 
tion for the cooperative bistatic radar has been illustrated. However, there is little 
literature focused on the antenna pattern loss in the passive bistatic radar. 

This paper concentrates on the property of passive bistatic radar based on noncoo- 
perative conventional monostatic radar emitters. The model of transmitting antenna 
scanning modulation will be proposed in section 2. The coherent integration loss 
caused by antenna scanning modulation is dealing with in section 3. The analytical 
expression for SNR loss is derived, and the corresponding simulation analysis is pre- 
sented as well, and conclusions are made in section 4. 


2 Problem Description 


A scenario with non-cooperative radar illuminator considered in this paper is illustrated 
in Fig. 1. Assume that the transmitter is mechanically scanned in azimuth, and the 
receiver is stationary while the target is moving. Both the transmitter and receiver are 
focusing on the moving target. The receiving system intercepts the direct-path wave- 
form transmission through a reference antenna when it tunes to the transmitting fre- 
quency, and target reflection echoes are intercepted by the target antenna. Time and 
phase synchronization need to be completed via direct-path signal. Then, surveillance 
and early-warning may be achieved in the area of interest. 


ainob 
say me 
a ain ete 
Signy ras™ 
of wal 


Opportunity 
illuminator 


Passive bistatic receiving system 


Fig. 1. Schematic of passive bistatic pulsed radar system 


When space synchronization between the transmitter and receiver is completed, the 
antennas’ main lobes will cover the target. Then the target echo can be intercepted 
continuously. While at this time, the direct path reference signal is intercepted via the 
transmit antenna’s sidelobe radiation. It is found that the propagation factor between 
the direct path channel and the target channel is different during the coherent dwell 
time. The transmitting visual angle is changing with the movement of target. The geo- 
metry of the passive bistatic system is time variant as well. Therefore, it is desired to 
consider the modulation effect on the output of coherent integration. 
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3 Normalized Field Lobe Pattern for General Antenna 


For most radar antennas, such as parabolic, horn, or array antenna, if the relationship 
between the antenna aperture size d and the wavelength 4 satisfiesd/A >4~5, the 


normalized field lobe pattern can be approximately expressed as [9] 


F(6)= sin(1d6/A) 


1 
md0/A ” 


where @ is the angle deviation relative to the antenna main lobe, then the relative gain 
coefficient of different angles is 


G(0) _| sin(nd0/A)] 
G(0) ndO/A | 


(2) 


where G(0) is the maximum gain corresponding to antenna’s main lobe, the maxi- 
mum gain of sidelobes appear at angles when sin(zd@/A) =1, thus the relative gain 
coefficients at the maximum sidelobes are 

G6). 1 

G(0)  (nd6/A)” 


(3) 


If 1d0/A =nn(n=0,1,2,---), then sin(md@/2) =0, which are corresponding to the 


nulls of the antenna’s pattern. 


4 Coherent Integration Loss due to Antenna’s Modulation 


4.1 Model for the Received Signals 


Let the envelope of the transmit signal be 5,(t), then the direct path signal’s 
envelope can be expressed as 5, (t) =kpFp(t)5;(t-Tp)+fp (t), and the echo signal’s 
envelope will be Sp (t)=kgFe(t)5;(t-Tp)+7ig(t), where f)(t) and fp (t) are the 
noisy terms in the direct path channel and the target channel, with zero mean and of 
N, /2 variance, kp, and kp are attenuation factors for the direct path channel and the 


target channel, respectively. F,(t) and F,(t) are propagation factors, which are 


time-varying parameters with the scanning, 7, and Tp are time delays of the direct path 


signal and the target echo path. 
Suppose that the pattern propagation factor in the antenna’s main beam can be ap- 
proximately expressed as 


Fy (t) = exp(-1.3977 /T”) . (4) 
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Similarly, according to Eq.(3), the pattern propagation factor in the visual angle 0, 
can be approximated as 


F, (t) =A/nd@, exp(-1.397 /T") . (5) 


where -—T/2 <t<T/2, herein t corresponding to the antenna’s sweeping time in the 3 


dB beam width. 

Without loss of generality, let the rotating speed of the antenna and the pulse repeti- 
tion frequency (PRF) of the emitter’s signal be constant. And suppose that the receiv- 
ing system can intercept each pulse accurately during the coherent dwell time. Then 
coherent integration can be adopted using the target echo pulse train obtained during 
the space synchronous condition. The integration gain depends upon the target cohe- 
rent dwell time. In general, the coherent integration time or dwell time target is equal 
to time of the antenna’s 3 dB beam width illuminated on the target. 


4.2 The Ideal Output SNR of Cross-Correlation Processing 


According to [4-6], passive coherent processing is widely used to detect and estimate 
the presence of a target. Typically, the received signals are cross correlated over a 
two-dimensional ambiguity surface to compensate for time delay and Doppler differ- 
ence of the target echo. If there is no scanning modulation effect in the direct path 
channel, and the direct path signal is no noise-free, then the expectation of the cross 


correlation output during the interval [-7/2,7/2] is 
T/2 s 7 T/2 a 
E;{ 5(T) | =| f kpkp Fa (t) S;(t-Tp) Sp (t-Tp -ya| +e i) kpitp (t) Sp (t-Tp -na} — (6) 
-TI2 -T/2 
where T is the total dwell time. When T =T, —7T,, the output will be 


Es[5(T) |= e| i keky Fe (t)|Sr (t~t,)[ a e| if kpfiz (t) S> (ral .  @) 


-T/2 


The output SNR of Cross-correlation process is defined as 


es [ (7) Ens 5(7)]) 
vars[5(T)} 


where E, [ 5(T) denotes the expectation of the echo in the target channel, which 


SNR = (8) 


contains an actual target, and var, [ 5(T) | is its variance, Eys [ 5(T)| denotes the 


expectation of the echo in the target channel which contains noise only. If the noise in 
the receiving system is zero mean, then E,,, [ 5(T) | = 0, and we have 


E,3(T)l= e| { kek pFy (t)|S, (1)f a} , (9) 


-T/2 
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2 
Vary [5(T)]= 43 | f 5" (nat (10) 
2 -T/2 
Thus, the peak output SNR in the ambiguity surface will be 
T/2 ~ ae 
If Keka Fe (1) Sr (2) a 
(SNR)... = rae 5 (11) 
S" (t)dt 
—T/2 7 


Without loss of generality, let signal amplitude be a constant for simplify the expres- 
sion, 1.e. A, then 


_ 2kg A’ | p72 
(SNR),,, = NE | 28P(-1,397 /77) (12) 
Thus 
kA? | VaT ' 
SNR). =— f (0.59 : 13 
( ss NT a er ( ) ( ) 
2 ‘x 2. 
where erf (x) =—~] e“ du. 
(=f, 


4.3 SNR Loss Caused by Antenna Scanning Modulation 


The actual cross-correlation processing is performed with the received signals that 
modulated by the antenna’s pattern during the coherent dwell time. Therefore, the 
cross-correlation output should be 


T) =| [ke Fe (t) 5; (t-te) + ig (t) [ko Fo (1) 55 (1-2-2) +, (t-2) Jar. (14) 


T/2 


when T =T, —T,, the peak of correlation output is 


L T/2 2 T/2 ro 
(T)=]_ kako Fe (1) Fo (1) |S (t-te) dt+ | keFe(t) i (t—t,) it, (tT, —T, )dt 
T/2 bs T/2 is 
J pokoFo (1) Sr (t—Te fig (1) dr-+] nti (t) Mp (t Tp ~ Tp) dt 


(15) 


Obviously, we have Ey. [ 5(T) =0. For the purpose of comparison, we also assume 


that signal amplitude is A 


E,[ 5(T) ]=kpkpA? ie F, (t) F, (t)de. (16) 
~ N, pT/2 Ny 2 
Vatys | 5(7)]= ke A” > as Fe (t Jar aga? 2 [" F(t (jare( 2 ; | Fe (17) 
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Substitute F(t) and F, (¢) into Eq.(17), we have 


22 447? 2772 mdO)~ pri2 27? 
Keka’ exp(-2x1.3907 /T Jar( 92) J oexp(-2x1.390° 17° at 
SNR = 5 
N 242) 1/2 2 2 242 nmd@ T/2 2 2 N, 
“aA I, ex (-2X1.390° (7?) dt-+ kp (=) J jaXP(-2x1.390° 17?) dr 7 
(18) 


4.4 Quantity Analysis of SNR Loss 


According to the assumptions in Section 4.1, the average SNR direct channel and 
target channel can be expressed as follow 


2 -1 
(SNR), =kpA? mae) | Mog [e. exp(—2x1.397°/T? )dt (19) 
i oe | 2 -1/2 
(SNR) =a (" exp(—21.397? /T?) dt Nor : (20) 
rR” Jrp : 2 


Then, the SNR loss with respect to the ideal output of match filter will be 


T/2 2 2 
- J yo8xP(-2x1.391 /T°) dt (SNR), ; (SNR), es 
* (SNR) +(SNR),+1 (SNR) +(SNR), +1 


Ie exp(—1.39r?/T? )dr 


From Eq.(21), it is found that the SNR loss caused by the modulation of antenna de- 
pends on the average SNR in the coherent dwell time of the direct path and the target 
channel. 
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Fig. 2. Coherent integration loss V.S. the average SNR in the direct path 
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normalized pattern of antenna 
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Fig. 4. The curves of the SNR loss factors V.S. the visual angle 


Therefore, in order to improve the average SNR in each channel, we should make 
full use of range cells in every dwell time that contain effective target echo for cross- 
correlation processing. In general, the mismatch between the receivers’ noise band- 
width and signal’s the will make the average SNR deteriorate. Thus, it is desired to 
match the receivers’ noise bandwidth with the non-cooperative emitter’s signal band- 
width. Generally speaking, the higher is the average SNR in the direct path channel, 
the smaller is the SNR loss factor, as can be seen in Fig. 2. When the average SNR in 
the direct path channel is fixed, the SNR loss will increase if the average SNR in the 
target channel increasing gradually. In practice, the actual SNR in the target channel 
is much smaller than 0 dB. If the average SNR in direct path is greater than 15 dB, the 
SNR loss caused by antenna modulation can be ignored approximately. 

Fig.3 shows that a normalized field lobe pattern for the aperture d=3m 
and A = 0.03 m. Let the SNR of the direct path received via its main lobe illumination 
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be 20 dB. If the direct path signal received from different sidelobes is adopted as 
reference signal of cross correlation processing, then the SNR loss is illustrated in 
Fig. 4. According to the curves of SNR loss V.S.@, , the SNR losses deteriorate rapid- 


ly as the visual angle becomes larger gradually. In fact, the average SNR in the direct 
path declines sharply as the level of the sidelobes are decreasing quickly. Therefore, 
only the signals intercepted from relatively higher sidelobes can fulfill the target 
detection. 


5 Conclusion 


The modulation phenomenon caused by the rotating of transmit antenna was illustrated 
in detail. The general expression of normalized filed pattern for widely used antenna 
was introduced for the purpose of quantity analysis. The direct-path signal and target 
echo modulated by the transmit antenna, were described in detail. Mathematical repre- 
sentations for direct-path signal and target echo were proposed with respect to the 
analytical expression of antenna’s pattern. Then, the SNR loss of the coherent integra- 
tion via cross-correlation processing was derived. The relationship between SNR loss 
and SNRs of direct-path channel and target echo channel was evaluated by simulation. 
According to the simulation result, it is found that the SNR loss caused by antenna 
modulation can be ignored if the average SNR in direct path is greater than 15 dB. 
However, the SNR losses deteriorate rapidly if the visual angle becomes larger gradu- 
ally. In order to increase the average SNR, it is also desired to match the IF receivers’ 
noise bandwidth with the non-cooperative emitter’s signal bandwidth. 
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Abstract. Results in theory of compressive sensing enable the reconstruction of 
sparse signals from a small set of non-adaptive linear measurements by solving 
a convex optimization problem. Considering the sparse structure of actual target 
space in Ground Penetrating Radar (GPR) application, a data acquisition me- 
thod based on random aperture compressive sensing (RACS) is studied in this 
paper, which requires the GPR transceiver to record only a minimum amount of 
samples through incoherent measurement at each aperture point, and only to 
measure a small number of random apertures in x-y plane of interested target 
space. Results indicate that the method allow much fewer sampling data. 


Keywords: compressive sensing; GPR; random aperture; data acquisition. 


1 Introduction 


The efficient utilization of GPR system for above civil and military application areas 
nondestructive speculative searches not only lies on the performance of GPR hard- 
ware system, but also rests on the efficiency of GPR subsurface data acquisition, 
imaging and feature detection method [1][2][3][4]. Familiar GPR method such as 
Range Migration (RM) Method [5], Reverse Time Migration (RTM) Method [6] and 
time-domain Standard Back Projection (SBP) Method [7] [8]require fine spatial sam- 
pling and Nyquist rate time sampling of the received signals, or a high aperture densi- 
ty measurement, and then they perform matched filtering with the impulse response of 
the data acquisition process. They don't use any prior knowledge about the target 
space, such as the spatial sparsity of targets. 

Compressive Sensing involves taking a relatively small number of non-traditional 
samples in the form of randomized projections that are capable of capturing the most 
salient information in original signal [9] [10]. In this paper we study data acquisition 
method for impulse Ground Penetrating Radars which based on random aperture 
compressive sensing by exploiting structure sparseness of the target space. First we 
give an introduction of compressive sensing theory. Then we present random aperture 
compressive sensing method and verify it's validity. 
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2 Compressive Sensing 


Consider a real-valued, one-dimensional, discrete-time signal x, which can be viewed 
as an Nx1 column vector in RN with elements x[n], n=1,2,...,N. Any signal in RN can 
be represented in terms of a basis of Nx1 vectors {y,}",. Using the NxN basis matrix 
y=[w]ly,]...[vy] with the vectors {y,} as columns, a signal x can be expressed as [11] 
[12] 

N 


x=) sy, or x=Vs (1) 


i=l 
where s is the Nx! column vector of weighting coefficients s, = (x, '¥;) = "x and 
-” denotes transposition. 

In common data acquisition systems, the full N-sample signal x is acquired, and 
the complete set of transform coefficients s is computed via s=‘¥’x , then the K 
largest coefficients are located and the (N-K) smallest coefficients are discarded. 
Compressive sensing acquires a compressed signal representation without going 
through the intermediate stage of acquiring N samples [13] [14]. 

Consider a general linear measurement process that computes M<N inner products 
between x and a collection of vectors {¢}", as in y, =(x,®,). Arrange the measure- 


ments y; in an Mx1 vector y and the measurement vectors ¢/ as rows in an MxN 
matrix ®. Then, by substituting ¥ from (1), y can be written as 
y=®x=0Ws (2) 


Compressive sensing theory indicate that if the matrix ®Y has the Restricted Isome- 
try Property (RIP) [15] [16] then it is possible to recover s exactly from O(Klog(N)) 
measurements by solving the following /;-norm minimization problem 


min |S St. y=OWS (3) 


Compressive sensing now has found a variety of interesting applications [17] [18]. 
This paper explains the application of Compressive sensing to data acquisition in 
GPR. 


3 Random Aperture Compressive Sensing Method 


For the i" aperture point, the discrete data GPR sampled and received can be written 
as [19] [20] 


R =| RGR DR EH| (4) 


Where F’ is the sampling frequency, 1, is the initial time of received i" aperture point 


signal, and N, is the number of temporal samples. Usually, F, is large in order to 
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afford adequate sampling data for existing algorithm, such as RM, RTM, SBP algo- 
rithm to implement high-resolution subsurface imaging, and all aperture points in x-y 
plane of target space need to be measured. Based on compressive sensing, a signal 
sampling and recovery theory, random aperture compressive sensing algorithm 
(RACS) record only a minimum amount of samples through incoherent measurement 
at each aperture point, and chosen to measure only a small number of random aper- 
tures in x-y plane of interested target space, RACS utilize the acquired small amount 
of random aperture and incoherent measurement data to reconstruct the original target 
space, the random aperture compressive sensing data acquisition process can be 
presented as 


D, =®R, = ®P b (5) 


Where ® is an MxN, measurement matrix and M_ N,, D, is the “x1 compressive 


sensing acquisition data at i” aperture point. 
By solving an /;—norm minimization problem 


min || St. D, = ®Y,b (6) 


The target space indicator vector at i" aperture point can be reconstructed [21], and 
the original target space can be obtained by add all indicator vectors of chosen ran- 
dom aperture point. 


4 Simulation Results 


In simulation we set N,=512 and sampling interval to be10ps , width of Gaussian 
pulse w=80ps. Virtual target space is shown in fig.1. The discrete samples of all aper- 
ture points in x-y plane are obtained according to formula (4). Instead of measuring 
all the space-time domain response at each aperture position, RACS form 20 inner 
product measurements at 32 randomly chosen aperture points making 640 measure- 
ments in total. The inner products is the product of time-domain response with® , 
which of size 20x512 with all entries drawn independently form 4(0,1/J512) . 

The sparsity pattern vector for virtual space has length 2560 and there are 640 
measurements which result in an underdetermined equation, D=>®R=®¥b . Least 
squares method provide a possible solution b=(®¥)'(@¥(@P)")D , the target space 
image for this is shown in fig.2 for compare. SBP using all of 512x256 space-time 
data, and do matched filtering of the measured data with the impulse response of the 
data acquisition process for each spatial location, and then add up the amplitude value 
of received signal at all aperture points with same round-trip delay value to obtain 
target space indicator vector and reconstruct target space, which result is shown in 
fig.3. 

RACS utilizes 20 measurements of each aperture point, and measure 32 random 
aperture points to recover target space indicator vector, for the numerical solution of 
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(6) a convex optimization package called /,-magic [22] is used, the target space image 
by RACS is shown in fig. 4. 

It can be seen that the actual target positions are found correctly by RACS me- 
thod, and the obtained image is much sparser compared to the SBP result in fig.3, 
even though the SBP result is obtained using all of the space-time data, since RACS 
exploits sparseness characteristic of the target space. The convex optimization result 


has less side lobe in the image since the problem (6) forces a sparse solution through 
the 11-norm minimization. 


Slices of Virtural Tee Space 


B(5,10,5) cae 


B(5.6.8) 


B(10,5,7) 
Aperture Number 


Scan Line 


Fig. 1. Slices of Virtual Target Space. 


Result Slices of Least Squares Method 


Depth 


-20 


-25, 


Aperture Number Scan Line 


Fig. 2. Result Slices of Least Squares Methods. 
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Result Slices of SBP Algorithm 


Aperture Number Scan Line 


Fig. 3. Result Slices of SBP Algorithm. 


Result Slices of Proposed RACS Algorithm 


Depth 


Aperture Number Scan Line 


Fig. 4. Result Slices of Proposed RACS Algorithms. 


5 Conclusion 


None existing subsurface data acquisition method uses the spatial sparsity prior know- 
ledge about the target space, and they need the GPR transceiver to work at high sam- 
pling frequency, high measurement aperture density in order to provide there enough 
data to practice subsurface imaging. In this paper we studied a data acquisition me- 
thod for impulse GPR based on random aperture compressive sensing by exploiting 
sparseness in the target space, RACS method record only a minimum number of sam- 
ples through incoherent measurement at each aperture point, and chosen to measure 
only a small number of random apertures in x-y plane of interested target space, 
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which proved to be much more robust to noise and fewer measurements needed when 
compared to existing subsurface data acquisition procedure. 
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Abstract. The convergence speed of algebraic reconstruction technique (ART) 
depends heavily on the order in which the projections are considered. In this 
study, a projection access scheme based on prime number increment is pro- 
posed, which is applicable to uniform projection sampling in any angle range. 
We compared the results reconstructed from the proposed method with the re- 
sults reconstructed from the conventional sequential method, the prime number 
decomposition method and random ordering method, for cone-beam X-ray 
computed tomography reconstruction and for the case of circular acquisition. 
The results indicate that using the proposed method can accelerate the conver- 
gence of ART and produces more accurate images with fewer artifacts. 


Keywords: CT reconstruction, ART, projection order, prime number 
increment. 


1 Introduction 


Cone-beam x-ray computed tomography (CT) is one of the most important non- 
invasive imaging techniques. In the x-ray CT reconstruction, a volumetric image of 
object is reconstructed from the projection data. There are two major categories of CT 
image reconstruction: analytic and iterative methods. Iterative methods such as alge- 
braic reconstruction technique (ART) [1] and expectation maximization (EM)[2-3] are 
superior to analytic methods such as the FDK [4] and the Katsevich algorithms [5] in 
handling incomplete and noisy projection data. 

A relatively high demand for computational time is the main drawback to use itera- 
tive methods. Several approaches have been developed to accelerate the computation 
of iterative methods. One method is using the optimized ordering schemes for ART. 
Several methods have been proposed and evaluated so far [6—9]. 

The aim of this study is to design a “good” permutation of the ordering of the pro- 
jection view’s index using the simple method. This paper is organized as follows. 
A brief review of ART and the proposed projection access system based on prime 
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number increment are given in Section 2. Section 3 describes the experiments, quantit- 
ative evaluation metrics, and the results. The conclusion is given in section 4. 


2 Methods 


2.1 Algebraic Reconstruction Techniques 


The image reconstruction problem in CT can be modeled by the following equation: 
AX =Y () 


where Y =(y,,¥o5°°'s Yy )! is the projection data and J is the total number of projec- 


tion rays, X = (X1,Xy,°°°X, yr is an unknown image and J is the total number of the 
voxel in the image, and A = (a,;);,.7 is the projection matrix and aj; is the length of 


projection ray i through voxel j. The problem is to reconstruct the X from the Y . 
The ART algorithm provides an efficient iterative way to solve the problem. It can 
be written as: 


3 
(k+l) _ _(k) j=l 
Xj =X; +A, —. (2) 


where i=k mod(/)+1 and A, is the relaxation parameter. This method was origi- 


nally discovered by Kaczmarz in [10]. 
2.2 A projection Access System Based on Prime Number Increment 
Denoting M as the total number of projections and the projections P,,0<i< M —1, are 


20 
assumed to be equally spaced by an angle g= va in the interval [0,27). In the fol- 


lowing, we will give the permutation t of the ordering of the M projections 
{1,2,---,M} using prime number increment: 


tT) =(tG-1)+P)mod(M) 1sisM-1 


3 
T(0) =0 Si 


where P(< M) is the prime number and the P is not divided exactly by the M. 
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Note also that in the special case when M is a prime number, P(< M) can be any 


positive integer. The sampling pattern is non-periodic because M is not divided exact- 
ly by the P . This implies that all measured projections index is processed during the 
iteration. 


3 Evaluations 


3.1 Phantom and Simulation Setup 


The 3D 128x128x128 voxels Shepp-Logan model is selected as the test model. The 
three central slices are shown in Fig. 1. 


(a) X=0 (b) Y=0 (c) Z=0 


Fig. 1. The central slices of 3D Shepp-Logan model. 


We use a circular orbit to acquire projection views[F), F,,---, Py_;]. System di- 
mensions were as follows: source-to-detector distance equal to 1,000 pixels, source- 
to-centre-of-rotation distance equal to 200 pixels. The detector consisted of 160 x 
160 detection channels. To eliminate the effects of another variable in the compari- 
son process, we used a fixed value of A, =0.2 throughout the reconstruction 
procedure. 

To verify the validity of the proposed method, projection data comprise 360 pro- 
jection views distributed uniformly in angle [0, 27], and ART using Prime Number 
Increment (PNI) is compared with ART using Sequential Access (SAS), Prime Num- 
ber Decomposition (PND) and Random Access (RAS). 


3.2 Assessment of Image Quality 


Both qualitative and quantitative evaluation methods are used in this work. Qualita- 
tive evaluation involves visual comparison of different reconstruction methods. Quan- 
titative evaluation involves calculation of the normalized mean square error 
(NRMSE) and the correlation coefficient (CC). 
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The NRMSE is defined as 


J 
Ds (x; — X; \ 


NRMSE =| | , (4) 
Y (4; -x)” 
i=l 


where x, refers to the i th voxel value in the true image , a represents the value of 


voxel i in the reconstructed image, X represent image average value in the true 
images. 
The CC is defined as 


J 
by (x; —Xx)(K; — x) 


O__ = (5) 


J J 2 
YD (aj — 3° Y (& - 0)? 
i=l i=l 


where X represent image average value in the reconstructed images. The CC measures 
the extent to which two images are similar to each other and it takes the highest value 
of unity if the two are exactly the same. 


3.3. Results 


Fig. 2 gives the plot of NRMSE as a function of the prime number after | iteration 
when the 360 projection views are distributed uniformly in angle [0, 27). In fig. 2, the 
upper and the nether beeline denote the NRMSE reconstructed from PND 
(NRMSE=0.1996) and RAS (NRMSE=0.1980) after 1 iteration. From fig. 2, we can 
see the NRMSE reconstructed from PNI after 1 iteration is less than the NRMSE 
reconstructed from RAS, when P=11, 13, 17, 29, 31, 37, 41, 53, 71, 73, 101, 109, 113, 
139, 157, 167, 199, 223, 227, 239, 241, 251, 307, 331, 347, 349. 

In the following, we compared the PNI (P=157) with SAS, PND and RAS. Table1l 
and table 2 list the NRMSE and the CC reconstructed from PNI (P=157), PND and 
RAS after 5 iterations. From table 1 and table 2, we can see the optimized projection 
ordering systems (PND, RAS and PNI) are better than conventional sequential me- 
thod (SAS) and the PNI is slightly better than PND and RAS in convergence speed. 

Reconstruction results for three central axial slices after three updates are shown 
in Fig. 3. The graph clearly shows differences in intensity between the images gener- 
ated by SAS and the optimized projection ordering systems. However, the differenc- 
es in intensity between the images generated by PND, RAS and IWDS are almost 
invisible. 
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Fig. 2. Plots of the NRMSE versus the prime number after | iteration. 


Table 1. The NRMSE reconstructed from projection access systems(SAS,PND,RAS,PND) after 
5 iteration 


Iterations SAS PND RAS PNI 
(P=157) 

1 0.304499 0.199631 0.198078 0.197382 

2 0.164394 0.120987 0.120266 0.119865 

3 0.109118 0.086962 0.086848 0.086393 

4 0.081984 0.068216 0.068468 0.067936 

5 0.066087 0.056454 0.056923 0.056333 


Table 2. The CC reconstructed from projection access systems(SAS,PND,RAS,PNI) after 5 
iteration 


Iterations SAS PND RAS PNI 
(P=157) 

1 0.948793 0.975845 0.976484 0.976613 

2 0.980074 0.988581 0.98889 0.988959 

3 0.988645 0.992707 0.992862 0.993014 

4 0.992525 0.994706 0.994688 0.994819 

5 0.994619 0.995840 0.995788 0.995933 
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PNI (P=157) 


Fig. 3. Reconstruction results at the central location of three different axes after three iterations 
using different projection access systems. 


4 Conclusion 


In this paper, we introduced a new and simple projection access system PNI to accele- 
rate the ART algorithm and improve the quality of the reconstructed image. This me- 
thod is still adapted to the case that the projection views are distributed in limited an- 
gle. We analyzed the validity of the proposed method for cone-beam circular orbit CT. 
The PNI method is superior to SAS and slightly better than PND and RAS in conver- 
gence speed and reconstructed image quality. Results have been obtained for ART 
only, but it is anticipated that other iterative reconstruction algorithms like OS-type 
methods [11-15] will behave similarly. 

Mueller et al. also evaluated a method with constant angular increment in [9]. How- 
ever, their choices of 66.0°, 69.75°, 73.8° may be result in repeat projection sampling 
and differ from the method of this paper. 
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Abstract. In order to solve the communication system among the noise interfe- 
rence problem, we use a new method, the neural network and adaptive fuzzy 
system to model the system. With a certain signal source and a random noise to 
the training, the simulation effectively deal with the noise reaching the purpose 
of no noise. 


Keywords: artificial neural network, MATLAB, model. 


1 Introduction 


As the development of the artificial neural network, it has got much better applied 
effect in many fields. The model which is made in nonlinear system is the more 
important applied direction in discern and control. The artificial neural network pro- 
vides a strong tool for the nonlinear system which bases on its nonlinear approach 
ability and its study ability. At the same time the fuzzy inference can get the good 
fuzzy model which bases the given data adjusting the parameter[1]. This paper bases 
on the knowledge express ability and the neural network study ability in the fuzzy 
system and builds the ANFIS(Adaptive neuron-fuzzy system)[2], and to build a 
model of the nonlinear eliminates the noise system and to get the purpose of elimi- 
nating the noise. 


2 Neural Network PID Controller 


Neural network with learning ability and approach any nonlinear mapping ability, and 
thus resolve the uncertainty in the control of complex systems with very large applica- 
tions. Domestic and foreign scholars in recent years, neural network and traditional 
technology, applied to nonlinear system control aspects of a number of useful attempt, 
achieved some encouraging results. How to proceed from the classical PID control 
theory with the PID weights in the form proposed by the network structure to form 
complex neural network PID controller, both of neural networks in order to avoid 
possible local minimum, the establishment of a hybrid neural direct adaptive control 
structure and the corresponding learning algorithms, and achieved a good control 
effect. 
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2.1 Robust Adaptive Neural Network Controller 


Unknown system for the model proposes a complex control structure - the parallel 
self-learning neural network Robust adaptive control structure, it can use the learning 
ability of neural networks and nonlinear mapping capabilities to address the tradition- 
al online model of adaptive control online identification and controller design prob- 
lem for uncertain nonlinear systems in order to achieve high precision output tracking 
control; run the monitor through the introduction, neural network control method is 
usually overcome the existing problem of poor real time; use of a robust feedback 
controller to ensure the neural network model for studying the stability of the initial 
closed-loop system. Two methods are proposed by a conventional feedback controller 
to guarantee system stability. Therefore, when the neural network controller is de- 
signed with larger degree of freedom, it is better than the pure control of intelligent 
control performance. 


2.2. Study of Neural Network Optimal Controller 


Optimal control of linear and nonlinear neural network learning methods combined, a 
new complex nonlinear optimal controller. This control method with artificial neural 
networks in parallel, adaptive, self-learning ability to apply the present optimal con- 
trol, control system of compensation as part of completion of more accurate modeling 
and stability control, the control system has more advanced intelligence, is a very 
effective combination of methods. 


3 The System Model 


The noise eliminated may be look as a reflection from the noise space to the no noise 
space. Generally, the reflection is a complicated nonlinear function, the fuzzy system 
may be realize the complicated nonlinear reflection in this neural network, and makes 
it adapt the surrounding change and make it have the most robust. It can be used to 
the complicated analogy nonlinear system. It may be make up a completed noise 
eliminated system. The noise eliminated is to get rid of the noise from the advanced 
tackle and only restore the goal signal, using the neural network fuzzy system to com- 
plete the network train model[3]. After the network stabilized, it may be complete the 
noise eliminated. The number M which the network may be remembered is deter- 
mined by the neural element. The model of system is in Fig 1. 


M 
: a 


——_——— + The folded noise F 


Fig. 1. The folded noise measure process 
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In Fig1, I is the signal which is going to be measured, N is the noise source, D is 
the noise signal which is folded the measure signal M. D and N satisfied the nonlinear 
reflection f. As follows: 


D(k) =f(N(k),N(k-1),....) (1) 
M(k)=I(k)+D(k)=I(kK)+f(N(K),N(k-1),....) (2) 


4 System Simulation 


For example: the input and noise signal are X and nl respectively. Their curves are in 
Fig. 2 (a) and (b). 


/ \ 
| | 
\ 
cr 1 a! 


(a) the input signal 
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Fig. 2. The input and noise signal 
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If the folded measure noise signal n2(k) is as the Fig 3 
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Fig. 3. The folded noise 
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Fig. 4. The measured folded signal 


The measure signal M of the folded noise n2(k) satisfies the following relation: 
M(k) =X(k)+ n2(k) (3) 


The measure signal change curve as the following Fig 4. 

Choosing the two input and output neural vague system to simulate the nonlinear 
character and gains the input —output data (n1(k),n1(k-1), m(k)), using the ANFIS 
function to train the neural network fuzzy system. In training, every input variable 
subordinates the function number chooses 2, and the total fuzzy rule number chooses 
4, As the following, use the MATLAB sentence to carry out the training process. 

To eliminate the noise of the measure signal, if m1 is the measure signal which the 
system has eliminated the noise: the measure signal curve as Fig 5. 
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Fig. 5. Measurement of the signal after noise cancellation M 


By comparing the input to output, the signal source passes the ANFIS model, the 
output signal is the same as the input I, and it effectively eliminates the noise and 
gains the purpose of the noise eliminated. 


5 The Conclusion 


Given above in the context of non-Gaussian noise neural network adaptive fuzzy 
systems nonlinear processing, for the noisy source, by the neural network model of 
fuzzy system consisting of system and noise elimination in real time performance can 
meet the requirement. The nonlinear adaptive fuzzy neural network intelligent control 
strategy and the traditional control structure combining intelligent composite control- 
ler, whichever is in control of the advantages and features, has become a hot topic in 
the field of control, but also part of the solution types control of complex systems an 
important tool in both theory and practical application is very important. 
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Abstract. To deal with two major problems of electric vehicle (EV): the short 
driving range and the short life of batteries, a hybrid power system was de- 
signed and applied to the EV. It was composed of an ultracapacitor with high- 
specific power and long life, four lead-acid batteries with high-specific energy, 
and a bi-directional DC/DC converter. To improve the stability and reliability 
of the system, based on researching driving process and 4 synthesis robust con- 
trol, the driving mathematical model of the system was established, and the 
driving w synthesis robust controller for the system was designed. The simula- 
tion experimental results show that the hybrid power could not only increase the 
starting current, but also reduce the batteries’ discharge current and lengthen the 
life of batteries. Additionally, the ~ synthesis robust controller is superior to 
PID controller at response speed, steady-state tracking error and resisting 
disturbance. 


Keywords: Electric vehicle, Ultracapacitor, Hybrid power, Driving, Robust 
control. 


1 Introduction 


Due to the dual pressure from environmental pollution and energy crisis, it has be- 
come a general trend of the development of electric vehicles (EVs). EVs have im- 
proved their performance and made suitable for commercial and domestic use during 
the last decades. Nevertheless, they still have not achieved ranges as good as gas- 
powered conventional vehicles. This problem, due to the low specific energy 
contained in most electric batteries compared to that of gasoline, restricts EVs fast 
developing. It is very significant for the increase in range that the technology of ener- 
gy-regenerative baking is applied to EVs [1]. However, batteries have a poor ability to 
recover energy from a regenerative braking, and a scarce power capacity for a fast 
acceleration. For this reason, EVs may use an auxiliary energy system able to receive 
regeneration fast and give power during peak periods. The ultracapacitor with high- 
specific power looks as the most appropriate choice [2]. In this paper, a hybrid power 
system, in which batteries are in use as “main source” and ultracapacitor as “auxiliary 
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source”, was designed and applied to the EV. It is essential the development of an 
energy-management system to control the power flow between both sources. In order 
to improve the energy-regenerative efficiency, the optimal-control strategy is very 
important and should be researched. yu synthesis robust controller has strong robust- 
ness. In this paper, based on establishing the mathematical model of the driving 
process, a w synthesis robust controller for the system was developed and tested suc- 
cessfully. The hybrid power system controlled by uw synthesis robust controller can 
recover more energy and lengthen batteries’ life. 


2 Driving Process and Its Mathematical Model of Hybrid-Power 
EV 


In this paper, the EV employed a brushless DC motor (BLDCM), which has been 
widely applied in the field of EVs, due to a series of advantages: simple structure, 
reliable performance, high efficiency, large starting torque, etc [3]. The main circuit 
topology of the EV control system designed in this paper is shown in Fig. 1. 

During the start-up process, the batteries and the ultracapacitor drive the motor 
together. When EV runs at normal speed, batteries drive the motor alone, and mean- 
while recharge some energy to the ultracapacitor to prepare for accelerating or climb- 
ing. The principle of ultracapacitor driving motor is the same as batteries’, so we take 


batteries as an example to analyze the driving process and establish the mathematical 
model [4-7]. 
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Fig. 1. Hybrid-power EV’s main circuit topology. 


In this paper, the three-phase BLDCM works according to six states, that is, T1 to 
TO6 of the inverter work in turn according to the six states of Hall sequence, and in any 
state, two-phase work principle is the same. The BLDCM could be seen as the DC 
motor under the condition of two phases conducting. In this, we take phases A and B 
as an example to establish the driving equivalent circuit. During the driving process, 
the batteries drive the BLDCM by buck converter, at this time, Tl is PWM, T4, T9 
are on, and other MOSFETs are off, the current flow direction and equivalent circuit 
topology as buck converter are shown in Fig. 2. 
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Fig. 2. Equivalent circuit when batteries alone. 


The driving mathematical model can be established as follows (take Fig. 2 as an 
example): 

While T1 is in conducting state and cut-off state, the state equations [8] are respec- 
tively described as: 

Tl_on(O<St<d-T) 


di 
(L,+L,) - =v,-e,—-¢,—-i, (Gr+r,+n+n,,). (1) 
Tl_off(d-T<t<T) 
di : 
Oe ar im a ae ae (2) 
t 
where d is the duty cycle of T1_PWM, T is the operation period, 7,,, is the ESR of 


the batteries. 
Because the motor’s three phases are similar, in permitted scope, we have: 


L, =L, =D» €: =& =n U =% =",,, then (1) and (2) can be simplified as: 
di, ; 
2L,, ar =v, —2e, -i,°(3rn+2r, +h,,)> (3) 
di, : 
BE pe eos ae a ele a) (4) 
t 


According to the deflecting couple equilibrium, we have: 


d 
Se K in Ts (5) 
re (6) 


where J is the moment of inertia, @ is the rotational speed of motor, K, is the tor- 
que coefficient, J, is the load torque, and K, is the BEMF coefficient of armature 
winding. 
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: : T , : 
Suppose state variables are vali, o| , output is y=i, , after average 


processing, then: 


2r,+r+(2r+7,,):d+r,-(-d) K, v, d 
, 2L, L,, 2L,, 
w= x+ ; 
x, ot |e alle (7) 
J J 


y=[1 0]x 


After handling by perturbation method, and separation of steady-state variables and 
instant variables, we can get the linear small-signal mathematical model of driving: 


2r, +7,+(27 +4,,,)°>D+r,:U-D) K 


7 = 1 20 Nat Yp 
+1 _“ 0 
x= 2Ln En x+ 2L, -X-d+|2L, |-d 
ca: 0 0 0 0 (8) 
J 


y=[l O]x > 


where D and X are respectively the circuit steady values of d and x. 
When the batteries and the ultracapacitor drive the motor together, the current flow 
direction and equivalent circuit topology are shown in Fig. 3. 


Fig. 3. Equivalent circuit when batteries and ultracapacitor together. 


3 Driving » Synthesis Robust Controller Design for Hybrid-Power 
EV 


3.1 Equivalent Transformation of Control Problem 


The internal link structure of EV driving control system was shown in Fig. 4. The 
figure reflects the connection relations of nominal model Go, feedback structure Ka, 
uncertainty model, and performance index. In the modelling process, all uncertainty 
belong to the normalization transfer matrix 4, which describes the difference be- 
tween nominal model Go and the actual model G. Moreover, 4 is stable, and its norm 


boundary condition is ||Al|_. <1. 
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Fig. 4. The EV driving control system with the uncertainty (z - weighted control, p — perturba- 
tion, u - control variable, y - measurement input, d — disturbance, e - performance output). 


control of the system can be described as follows: for all the stable perturbation 
A, and |||, <1, finding a stable controller Kd that enabled the closed-loop system 


still to maintain stable in the perturbation situation. the weighting sensitivity transfer 
matrix is §(A) =W(1+G,(I +AW,,)K,) W, and has \|S(4)||. <1. The control problem 


in Fig. 4 can be equivalently transformed into a typical ~ design problem, and the 
transformation process is shown in Fig. 5. 
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Fig. 5. Equivalent transformation of control problem. (a) Equivalent connection diagram of 
open-loop control system, (b) Equivalent connection diagram of closed-loop control system 
after adding K,, (c) Closed-loop linear fractional transformation. 


3.2 The Selection of Weighting Function 


The choice of weighting function is the key to design the system controller, and di- 
rectly determines the performance of the system. Through the rational choice of 
weighting function, it can be guaranteed that the system has strong robust stability, 
good capacity of command signal tracking, anti-interference and noise suppression. 
Some literature works gave a few weighting function selection methods for some 
specific system [9][10], but due to the different characteristics of each system, it is 
difficult to give a general design method of weighting function. The main difficulty of 
weighting function selection lies in how the system closed-loop performance index 
was expressed in the weight function with as far as possible simple structure. The 
weighting function mainly includes performance index weighting function, model 
uncertainty weighting function, and input weighting function. 

According to the actual situation of the EV in the paper, after several adjustments, 
the frequency weighting functions of EV driving controller were taken as follows: 

Performance index weighting function is 


0.38 +100 


Ww eee 
($) = 0 005 


(9) 
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Model uncertainty weighting function is 


3(s +10) 
W,.1(S) =————_ 10 
we) =~ 150 2 
Input weighting function is 
s+10 
W,(s)= : 11 
a(S) 150 (11) 


3.3 yu Synthesis Robust Controller Solving Process 


According to the driving mathematical model described above, and the weighting 
functions from (9) to (11), uw synthesis robust controller can be optimally solved by the 
D-K algorithm in u-Tools of Matlab Robust Control Toolbox [11]. As a result, the 
structure singular value w=0.768<1, and closed-loop system achieved robust perfor- 
mance requirements. After handling by truncated equilibrium model, the synthesis 
robust controller reduced to 4 bands as follows: 


0.0312s° + 5.218657 +117.65695 + 19.3253 


4 3 2 , (12) 
Ss’ +256.68s° + 202.5034s° +35.215s +0.67 


K,(s)= 


In the practical application, (12) must be discretized to realize digital programming. 
In this paper, the control period was 7,=2 ms. The discrete controller is gotten as 
follows by the double linear transformation method: 


(2.907z* — 4.9462? — 0.83042" + 4.946z—2.076)-10°_ 
z* 3.5912? + 4.7732" —2.774z + 0.5916 


Ky(z)= (13) 


4 Simulation Results and Analyses 


The controller of hybrid-power EV designed in this paper was tested by MAT- 
LAB/Simulink. The hybrid power system uses a 3.125 farad ultracapacitor and four 
batteries with a nominal voltage of 48 volts. The driving simulation comparisons were 
respectively done with the PID controller and the w synthesis robust controller. The 
simulation results are shown as Fig. 6 and Fig. 7. 

Fig. 6 shows the response of battery current and ultracapacitor current when the 
load of EV was suddenly increased. When the driving current of EV was not big, the 
ultracapacitor stopped discharging. If the load was suddenly increased, the driving 
current of EV would be also increased. When the driving current exceeded 9 amperes, 
the ultracapacitor began to discharge, and worked together with the batteries. As 
shown in Fig. 6, the ultracapacitor could reduce the batteries’s discharging current. 
Additionally, the 4“ synthesis robust controller is better than the PID controller at re- 
sponse speed and steady-state tracking error. 
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Fig. 6. Current response when the load was suddenly increased. 


In order to compare the robustness of two controllers, the disturbance signal with 
2-ampere amplitude and 50-ms pulse width was imposed on the system at 2.5 
seconds, and the simulation results are shown in Fig. 7. As we see, the yw synthesis 
robust controller is superior to PID controller at response speed and resisting distur- 
bance. 
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Fig. 7. Resisting-disturbance comparison. 


5 Conclusions 


Based on constructing the control system of the hybrid-power EV, a 4 synthesis ro- 
bust controller was designed and applied to the EV. The following conclusions can be 
gotten from the above simulations: 

The ultracapacitor-battery hybrid power system with the w synthesis robust control- 
ler can enhance the instantaneous performance of EV when driving, avoid batteries 
being charged and discharging by big electric current, reduce the batteries’ charge 
times, and lengthen the batteries’ life. In the driving processes, comparing with tradi- 
tional PID controller, the “ synthesis robust controller has better robustness, and can 
improve the stability and reliability of the system. 
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Abstract. In order to promote the application of electroactive polymer, the ge- 
nerator mode theory fundamentals of electroactive polymer (take acrylic elas- 
tomer as an example) were discussed and studied. The power generation mode 
of electroactive acrylic elastomer (EAE) was studied by experiments based on 
designing a power generation measurement system for EAE. The power genera- 
tion process of EAE was tested under different influencing factors. The experi- 
mental results show that power generated when EAE relaxed. The bias voltage 
and pre-strain level had a great impact on the power generation of EAE. The 
voltage of power generation increased with the augmentation of the bias voltage 
and pre-strain level. 


Keywords: Electroactive polymer, Power Generation mode, Influencing 
factors, Bias voltage. 


1 Introduction 


Electroactive polymer (EAP) is a new type of polymer which responds to external 
electrical stimulation. Electroactive acrylic elastomer (EAE) is a particular type of 
EAPs and it has best demonstrated exceptional performance. This type of field- 
activated EAPs has shown tremendous promise for applications as actuators. Their 
advantages in converting mechanical to electrical energy in a generator mode are less 
well known [1]. EAE is formed by infiltering or coating the acrylic elastomer matrix 
with electrode material in the upper and lower surface based on Maxwell effect [2]. 
EAE could be viewed as a compliant capacitor, that is, a sandwich structure that the 
thin passive elastomer is sandwiched between two compliant electrodes. Compared 
with other types of EAPs, EAE could produce bigger strain, and have better flexibility, 
lower density, and lower cost, etc [3][4]. This material could be used as the driver 
material, the bionic muscles, and the related components used to construct micro elec- 
tromechanical system (MEMS), such as micro-motors, micro-pumps, micro-valves, 
etc, moreover, it could be used in power generation, especially suitable for distributed 
power generation [5-7]. Therefore, EAE will have broad application prospects. 
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EAE is capable of converting energy in the form of electric charge and voltage into 
different form of mechanical force and vice versa. In this paper, EAE was realised 
with a pre-stretched circular film coated with compliant electrodes of graphite powder 
and conductive adhesive. The mechanism and influencing factors of EAE power gen- 
eration were studied by theoretical analysis and experiments. The research on the 
EAE was expanded. The chief purpose of this study is to provide the basis for the 
design and use of the EAE power generation. 


2 Power Conversion Process of EAE 


In this paper, we employed EAE as EAP. Acrylic elastomers have demonstrated an 
estimated 0.4J/g specific energy density, compared to around 0.1J/g for advanced 
single crystal ceramics and around 0.04J/g for peak electromagnetic. Much higher 
energy densities, over 1J/g, are predicted. High conversion efficiency is predicted, 
theoretically up to 80-90%. In addition to superior performance, EAEs have two fea- 
tures that distinguish them from other energy-conversion materials: they are made 
from low cost materials that can be easily fabricated and they are compliant. 

There are many advantages [1] of EAE power, such as, lighter — low density, high 
performance, multifunctional polymers; cheaper — inexpensive materials, fewer parts, 
no precision machining; quieter — high energy density and compliance of polymers 
allows quiet primarily sub-acoustic operation with few moving parts; softer — rubbery 
materials are impedance matched to large motions (e.g. human motion, engines); 
versatile — polymers are scale-invariant; systems can be made in variety of form fac- 
tors (conformal, elongated, etc.). 

EAEs could convert electrical energy to mechanical work and vice versa, as shown 
in Fig. 1. 


ENERGY MECHANICAL WORK 


(ELECTRICAL) 


GENERATOR OR SENSOR 
Fig. 1. Power conversion process of EAE. 


EAEs are a type of EAP that uses an electric field across a rubbery dielectric with 
compliant electrodes. The basic functional element is shown in Fig. 2. Incompressible 
polymer gets thinner and stretch in area when a voltage is turned on. Incompressible 
polymer gets thicker and contracts in area when a voltage is turned off. 
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Compliant electrodes 
(on top and bottom 
surfaces) 


Voltage on 


Polymer film 
Voltage off 


Fig. 2. Basic functional element. 


Power generation principle of EAE is shown in Fig. 3. It is like a variable capacitor 
generator. Energy generated as nearly incompressible elastomer layers increase in 
area and decrease in thickness when stretched, and power generated when EAE 
relaxed. 


Compliant Electrodes (2) 


t+ oF tH HE pH, (lO) 


Electroactive Acrylic Elastomer Oe a 


EAE STRETCHED EAE RELAXED 


Fig. 3. Power generation principle diagram of EAE. 


3 Power Generation Measurement System Design 


Fig. 4 shows the power generation measurement system. The bias power supplies the 
voltage V, for EAE through a diode. Based on the bias voltage, the power generation 


was achieved by the EAE film contraction. The increased voltage in power generation 
process can be detected by a high-voltage probe (1000:1) and could be seen through 
an oscilloscope. 


(+ >} High-Voltage ( 1900:1) 
Bias re. 
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Power shore Voltage 
eatatte Measurement 
EAE Element 


Fig. 4. Measurement circuit. 
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4 Theoretical Analysis Model 


4.1 Strain Energy Function of Super-Elastic Materials 


In order to simplify the calculation in the theoretical analysis of acrylic elastomer, it 
could be assumed to be super-elastic material (Poisson’s ratio is 0.5) which could not 
be compressed, and possess uniformity. The mechanical property of super-elastic 
material is generally depicted by the general strain energy function W that may be 
expressed in various forms [8]. According to the analysis of simulation results in [9], 
Yeoh strain energy form was adopted in this paper. The strain energy form depends 
on /,, namely the first constant of the so-called left Cauchy-Green deformation tensor. 
The equation of the strain energy form is expressed as follows: 


W = C(I, -3) + Co (I, - 3)" + Cy F, - 3)’, (1) 


where, C,, C,,, C,, are the material parameters, and respectively C,, = 0.063 MPa, 


10? 20? 


C,, =—8.88x10~* MPa, one =16.7x10° MPa. J, was calculated in (2) through the 
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eigenvalue of deformation gradient tensor, namely strain rate 2, (i=1,2,3). 
[L=A? +A +a. (2) 


For incompressible materials, Cauchy principal stress (the force per unit area of de- 
formation structure) could be determined by A;: 


aga s Pe (3) 


where, p’ is the super-elastic force, and related with dynamic boundary conditions. 


4.2 Power Generation Principal Analyses 


According to [1], EAEs could be simplified as variable capacitors. The capacitance 
can be expressed as 


C=e€,Ald, (4) 


where € is the relative dielectric constant, E) is the permittivity of free space 


(8.85x10°' F/m), A is the total area of the polymer film, and d is the thickness. 
Both A and d depend on the strain. Because they are elastorners, the volume B of 
the EAE stays the same during stretching, that is, 


B=A-d. (5) 
From (4) and (5), have 
C=«,A7/B. (6) 
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The voltage V of EAE can be expressed as 


_Q_ QB 


C  6,A? 


= (7) 
where Q is the charge. 


Based on the theory of variable capacitor, the electrical energy generated by EAE 
can be deduced as follows: 


Suppose C, and C, are the capacitances of the EAE capacitor in the stretched and 


contracted states, respectively. In the stretched state, a bias voltage V, is applied to 


the EAE, and the same voltage V, is on the EAE capacitor in the contracted state 
after some amount of charge has drained through the drain resistor R. So have the 


stored electrical energy in the EAE capacitor in the stretched and contracted states 
respectively [1][10] as follows: 


1 1 
E,==V,,C,’ ==V,,Q, > 8 
Ss 9, nm ny 2 inQs ( ) 
feel 
E, =< —V,,C, = 5 Vine : (9) 


Suppose AQ is the charge that goes through the drain resistor, Q, and Q, are the 


stretched and contracted charges on the film, then have 
AQ =Q0,-Q.=[V(t)/R]dt . (10) 


Suppose E, is the energy dissipated through the drain resistor, E, = [V(t)?/ R]dt, 
and the electrical energy E, generated by the EAE can be deduced as follows: 


E, =E,-E, E,=5V,A0 (V(t)? /Rldt = SV, 1V ()/ Ride [V (0)* Ride (11) 


5 Experiments and Analyses 


The power generation process of EAE was tested by the experimental equipment and 
environment which comprises bias power, EAE element, high-voltage probe, and 
oscilloscope. The experimental results were recorded by the oscilloscope. 

The actuator and generator cycle process of EAE under different bias voltage were 
compared and shown in Fig. 5. In the experiments, the pre-strain was 300%, and the 
bias voltage was 1500 volts and 3000 volts respectively. When the EAE element was 
stretched and compressed, it worked in actuator mode and generator mode respective- 
ly. As shown in Fig. 5, EAE element can achieve a continuous power generation 
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in the stretched and compressed cycle process. Additionally, through contrast, it was 
found that with the bias voltage increasing, the amplitude of increased voltage and the 
cycle power generation level also increased. 
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Fig. 5. Actuator and generator cycle process. 


The pre-strain influence on the power generation level was tested and shown in 
Fig. 6. In the experiments, the bias voltage was 2000 volts, and the pre-strain was 
100% and 200% respectively. When pre-strain was 100%, the peak voltage reached 
about 2100 volts, increased about 100 volts. When pre-strain was 200%, the peak 
voltage reached about 2200 volts, increased about 200 volts. Through contrast, the 
result showed that the pre-strain level had a great influence on power generation, and 
with the pre-strain level increasing, the power generation level also increased. 
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Fig. 6. Power generation waveform vs. different pre-strain. 


6 Conclusions 


Based on theoretical analysis and experimental measurement for EAE, the following 


conclusions could be gotten: 
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When EAE relaxed, EAE became thicker and contracted in area, as a result, power 


generation was achieved. The bias voltage and pre-strain level had a great impact on 
the power generation of EAE. With the bias voltage and pre-strain level increasing, 
the power generation level also increased. In order to achieve the best result, the val- 
ues of the two factors could be maximized under the bearing capacity of the system. 
The experimental results accorded with analytical model and lay the foundation for 
generator and sensors of EAE. 
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Abstract. Gate-length biasing is an active leakage reduction technique that uti- 
lizes the short-channel effect by marginally increasing the gate-length of MOS 
devices to reduce their leakage current with a small delay penalty. DTCMOS 
(Dual-threshold CMOS) has been proven as an effective way to reduce sub- 
threshold leakage consumption in active mode. This paper proposes a 
gate-length biasing technique for clocked adiabatic logic (CAL) circuits using 
dual-threshold technique to reduce sub-threshold leakage dissipations. An im- 
proved CAL register file using DTCMOS gate-length biasing is addressed in 
this paper. All circuits are verified with HSPICE using a NCSU 45nm technol- 
ogy. The simulations show that the improved CAL register file with gate-length 
biasing and DTCMOS techniques can attain large energy savings. 


Keywords: Nanometer circuits, Gate-length biasing, Dual-threshold technique, 
Clocked adiabatic logic, Leakage reduction, Register file. 


1 Introduction 


The power dissipation is a key factor of limiting circuit performances and costs. In 
previous studies of low-power integrated circuits, the dynamic power consumption is 
the most concerns, and leakage dissipation is often neglected [1]. However, with the 
feature size of integrated circuits continues to reduce, the leakage dissipation caused 
by leakage currents catches up with the dynamic power consumption gradually and 
the standby power consumption is becoming an important factor in low-power design, 
which attracts extensive attentions [2]. 

A number of approaches have been proposed to reduce static leakage power when 
the system is in standby mode, such as VTCMOS or MTCMOS, input vector control, 
etc [3, 4]. Very few approaches, such as stacking transistor techniques [5, 6], dual 
threshold CMOS [7], and P-type CMOS design technology [8], have also been pro- 
posed mainly to reduce runtime leakages. 

For dual-threshold logic circuit, a higher threshold voltage can be assigned to some 
transistors on non-critical paths so as to reduce leakage current, while the perfor- 
mance is maintained due to the low threshold transistors in the critical paths. There- 
fore, no additional transistors are required, and both high performance and low power 
can be achieved simultaneously. 
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Gate-length biasing is an active leakage reduction technique that utilizes the short- 
channel effect by marginally increasing the gate-length of MOS devices to reduce 
their leakage current with a small delay penalty. Similar to the dual-threshold tech- 
nology, the gate-length biasing technology use longer gate-lengths transistor in non- 
critical paths to reduce leakage currents. 

Compared with conventional CMOS circuits, adiabatic circuits obtain low power 
dissipations, because they utilize AC power supplies to recycle the energy of node 
capacitances. In the adiabatic circuits, the dynamic power dissipation can be reduced 
effectively. This paper presents a gate-length biasing technique for clocked adiabatic 
logic (CAL) circuits using dual-threshold technique to reduce sub-threshold leakage 
dissipations. An improved CAL register file using DTCMOS gate-length biasing is 
also addressed in this paper. All circuits are verified with HSPICE using a NCSU 
45nm technology. 


2 Gate-Length Biasing Technology for Improved CAL Circuits 


The basic CAL gate, buffer / inverter, is shown in Fig. | (a) [9]. It is a dual-rail logic 
with true and complementary NMOS functional blocks (N1, N2) and cross-coupled 
PMOS latch (P1, P2). A sinusoidal power clock (clk) supplies the CAL circuits. CX is 
the auxiliary clock. The clamp transistors (N3 and N4) make the un-driven output 
node grounded. When the clk ramps down towards zero, the energy stored on the 
capacitance is recovered. 

In basic CAL gate, the clamp transistors (N3 and N4) are on non-critical paths, 
since they only used for making the un-driven output node grounded. Therefore, the 
clamp transistors (N3 and N4) can use high-V, transistors. The CAL buffer using gate- 
length biasing technology with dual-threshold techniques is shown in Fig. 1 (b). The 
cross-coupled PMOS latch (P1, P2) uses longer gate-length transistors further to re- 
duce sub-threshold leakage currents. 


clk 


Gate-length biasing 


in OUT a [ Low-/, transistor 


Cx | C High-F, transistor ce 


Fig. 1. (a) Basic CAL buffer, and (b) CAL buffer using gate-length biasing technology with 
dual-threshold CMOS. 
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In the improved CAL circuits, the charge of the auxiliary clock lines can be well 
recycled the power-clock clk because of using the sinusoidal clock signal, thus the 
energy loss of the improved CAL circuits is smaller than the conventional CAL cir- 
cuits, as shown in Fig. 2 [9]. 
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Fig. 2. Auxiliary clock generator and its simulation waveforms. 


The power loss component can be analyzed from the output waveforms of CAL 
shown in Fig. 3. The energy loss occurs when the nodes of the CAL buffer are 
charged or discharged. Based on the power dissipation models of adiabatic circuits, an 
estimation technique for the active leakage dissipations of CAL circuits has been 
proposed in [10]. 
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Fig. 3. CAL buffer chain and its simulation waveforms. 


The energy dissipation per cycle of CAL circuits can be represented as 


crotal = EE diabetic ot EF oh-aduabane a Eveak ? (1) 


where E is the full-adiabatic energy dissipation, E, is the non- 


non—adiabatic 


adiabatic energy dissipation and E,,,, 1s the average leakage energy dissipation per 


adiabatic 
cycle of CAL circuits, respectively. 
The full-adiabatic energy loss per cycle of CAL circuits can be represented as 


E tiabatie = (2°R,C, (2T VC Von > (2) 
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where C, is the additional load capacitance of the CAL buffer, which is introduced for 
the purpose of estimating leakage dissipations, T is the period of the power-clock, Vpp 
is peak-peak voltage of the power clock, and R, is the turn-on resistance of the transis- 
tors (Pl and P2). As shown in (1), the full-adiabatic dissipations of the CAL circuits 
with gate-length biasing is slightly larger than the basic CAL ones without using the 
gate-length biasing technique, since the gate-length of PMOS devices is marginally 
increased. 

As energy recovery through the PMOS transistors, the OUT or OUTb of the CAL 
buffer falls to Vpp - Vip, where V,, is the threshold voltage of PMOS transistors, thus 
resulting in non-adiabatic energy dissipation, which is given by 

E 


non—adiabatic 


=C,Vp (3) 


where V,, is the threshold voltage of PMOS transistors. As shown in (2), the non- 
adiabatic energy dissipation of the DI[CMOS CAL circuits is the same as the basic 
CAL ones without using the DTICMOS technique. 

The average leakage energy dissipation per cycle of CAL circuits can be 
represented as 


Eveakage a Vp! teak /2)T > (4) 


where J; .ax is the average leakage current per cycle of CAL circuits. In DTCMOS 
CAL with gate-length biasing, a higher threshold voltage is assigned to the clamp 
transistors (N3 and N4), and the cross-coupled PMOS latch (P1, P2) use longer gate- 
length transistors. Therefore, the leakage currents of the DTCMOS CAL circuits with 
gate-length biasing are reduced compared with the basic CAL ones without using 
DTCMOS technique and gate-length biasing. 

The total energy dissipation per cycle of CAL circuits can also be represented as 


ki” 
otal = es ae k,C, + k3T ’ (5) 


where k, is m RV 12, ky is Vi , and ky is SVool ess , respectively. The leakage 


power dissipation of CAL circuits can be estimated by measuring total energy dissipa- 
tions (Ficta, FEtota2 and Fyotai3) in three different capacitances (C,, 2C, and 3C,) of 
power-clocks [10]. The leakage power dissipation of CAL circuits can be given by 


Eveak 7 Evotal3 a 3B rotal2 ny SE otal (6) 


3 Improved DTCMOS CAL Register with Gate-Length Biasing 


The improved CAL 32x32 register file using the dual-threshold technology and gate- 
length biasing technology is shown in Fig. 4, and its structure is the same as [11, 12], 
which consists of a storage-cell array, address decoders, read/write word-line drivers, 
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sense amplifiers, read bit-line and write data-line drivers, and a auxiliary clock gene- 
rator that supplies auxiliary clock CX and CXb of the whole circuits. In the improved 
CAL 32x32 register files, the dual-threshold technology and gate-length biasing tech- 
nology have been used to reduce sub-threshold currents. 
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Fig. 4. The structure of improved DTCMOS CAL register using gate-length biasing. 


For comparison, three register files based on the basic improved CAL circuits 
(CAL register), improved CAL circuits using DTCMOS technology (DT-CAL regis- 
ter), and improved CAL circuits using gate-length biasing and DTCMOS technologies 
(GLB-DT-CAL register) are simulated with HSPICE at a NCSU 45nm CMOS tech- 
nology. In the GLB-DT-CAL register, the gate length of the transistors using gate- 
length biasing technology is 8% longer than the original transistors. 

Table 1 show the total energy dissipation per cycle of the three register files. Fig. 5 
show the leakage energy dissipation per cycle calculated according to Eq. 6. At 
20MHz, the GLB-DT-CAL register file can save the leakage power dissipation of 
25.2% and 10.4% compared with DT-CAL register and CAL register, respectively. 
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Table 1. Total energy dissipation per cycle of the 32x32 register files based on the basic 
improved CAL circuits (CAL register), improved CAL circuits using DTCMOS technology 
(DT-CAL register), and improved CAL circuits using gate-length biasing and DTCMOS 
technologies (GLB-DT-CAL register). The peak-to-peak voltage of the power-clock pc is taken 
as 1.1V. 
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Fig. 5. Leakage energy dissipation of the 32x32 register files based on the basic improved CAL 
circuits (CAL register), improved CAL circuits using DTCMOS technology (DT-CAL regis- 
ter), and improved CAL circuits using gate-length biasing and DTCMOS technologies (GLB- 
DT-CAL register). The peak-to-peak voltage of the power-clock pc is 1.1V. 
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4 Conclusion 


This paper has presented a gate-length biasing technique for clocked adiabatic logic 
(CAL) circuits using dual-threshold technique to reduce sub-threshold leakage dissi- 
pations. An improved CAL register file using DTCMOS gate-length biasing is ad- 
dressed in this paper. All circuits are verified with HSPICE using a NCSU 45nm 
technology. The simulations show that the register file with gate-length biasing tech- 
nology can attain large energy savings. The register based on DICMOS CAL circuits 
using gate-length biasing can save leakage power dissipation of 25.2% compared with 
the basic CAL register. 
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Abstract. Ultralow-power design has been a main challenge in modern VLSI 
circuits. This paper explores the near-threshold computing of adiabatic circuits. 
The characteristic of energy dissipations of the ECRL (efficient charge recovery 
logic) circuits is analyzed. It is found that its energy consumption is dependent 
linearly on operating voltage. In the near-threshold region, the ECRL circuits 
obtain considerable energy savings with a little performance penalty. An 8-bit 
ECRL Kogge-Stone adder is realized and simulated using HSPICE at a 45nm 
process with the NCSU PTM model. Simulations show that the power con- 
sumption of the near-threshold ECRL Kogge-Stone adder is reduced about 50% 
compared with the super-threshold one for clock rates ranging from 50MHz to 
1.0 GHz. 


Keywords: Low power, Near-threshold, Efficient charge recovery logic, 
Kogge-Stone adder. 


1 Introduction 


With the rapid progress in semiconductor technology, the density and operation speed 
of CMOS chips have been increasing, so that power consumption has become a criti- 
cal concern in VLSI circuits. The classical circuit techniques to reduce power dissipa- 
tion include transistor sizing and interconnect optimization, gated clock, multiple 
supply voltages, and dynamic controlling of supply voltage [1]. However, with the 
emergence of wireless sensors and biomedical applications that require ultralow ener- 
gy dissipations, the classical circuit techniques are not sufficient to reduce the power 
consumption. 

The power dissipation of a circuit can be significantly reduced by operating at a 
low supply voltage, since it has a square dependence on power supply voltage. It is 
shown that the minimum-energy point occurs in the sub-threshold region of MOS 
transistors for most logic families. Sub-threshold logic has shown significant 
improvement in the term of power consumption, in contrast to operation in strong 
inversion. Therefore, the sub-threshold digital logic design has obtained quit some 
attention over past the decade [2-5]. However, the performance of sub-threshold cir- 
cuits is much lower than super-threshold ones due to the exponential relationship 
between delay and supply voltage. Scaling supply voltage to sub-threshold region 
only suits for ultra-low operation frequencies. Moreover, the robustness of 
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sub-threshold logic circuits must carefully be considered, since their operation relies 
only on leakage currents that are exponentially dependent on Vy and are therefore 
more sensitive to process variation than traditional super-threshold designs [6]. 

Recently, the near-threshold computing is presented. The supply voltage of near- 
threshold circuits is slightly above the threshold voltage of the transistors. This region 
retains much of the energy savings of sub-threshold operation with more favorable 
performance and variability characteristics [7]. 

The design ideas mentioned above aim at power optimization for the conventional 
CMOS logic circuits. In fact, the adiabatic circuit that utilizes AC power supplies to 
recycle the charge of node capacitances is a particularly attractive approach to reduce 
power dissipation [8]. However, the previously proposed adiabatic logic families 
focus mainly on nominal voltage circuits. As the operation frequency of adiabatic 
circuits is determined by AC power supplies, for a given operation frequency, the 
energy dissipation of adiabatic circuits can also be reduced by operating at a low 
supply voltage with a little performance penalty. This paper explores the near- 
threshold computing of the ECRL (efficient charge recovery logic) circuits. An 8-bit 
ECRL Kogge-Stone adder is realized and simulated using HSPICE at 45nm process 
with NCSU PTM model. In the near-threshold region, the ECRL circuits obtain con- 
siderable energy savings with a little performance penalty. 


2 Review of ECRL Circuits 


The basic structure of the ECRL buffer (inverter) and trapezoidal power clocks are 
shown in Fig. | [8]. The buffer consists of cross-coupled PMOS loads (P1 and P2) 
and NMOS pull-down transistors (N1 and N2) that are used for evaluation. 


gy Evaluation Recovery 


tb) 


Fig. 1. ECRL buffer. (a) Schematic and buffers chain, and (b) Its four-phase power clocks. 


The operation of the ECRL buffer can be divided four processes: evaluation, hold, 
recovery, and wait phases. For convenience, we assume IN is high and INb is low at 
the beginning of a cycle. As the supply clock @, rises from zero to Vpp, OUTb 
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remains at a ground level, because JN signal turns on N1, while OUT follows @, 
through P2. When q, reaches Vpp, the outputs hold valid logic level. These values are 
maintained during the hold phase and used as inputs for evaluation of the next stage. 
After the hold phase, @, falls down to a ground level, OUT node returns its energy to 
@ so that the delivered charge is recovered. Thus, the clock @, acts as both a clock 
and power supply. The wait phase is inserted for clock symmetry. In this phase, valid 
inputs are prepared in the previous stage. The ECRL circuits use four-phase power 
clocks to recover the charge delivered by the power clocks. Each clock is followed by 
the next clock with a 90° phase lag. When the previous stage is at the hold phase, the 
next stage must evaluate the logic value in the evaluation phase. 

The general schematic of ECRL is shown in Fig. 2 (a), and it consists of PMOS 
loads and NMOS pull-down networks. The pull-down networks (PDNland PDN2) 
are complementary, and implement the required logic function. The schematics of 
ECRL AND/NAND, OR/NOR, XOR/XNOR, and AND-OR/NAND-OR gates are 
also shown in Fig. 2. 


OUTb 


(d) (©) 


Fig. 2. ECRL buffer. (a) General Schematic, (b) AND/NAND gate, (c) OR/NOR gate, (d) 
XOR/XNOR gate, and (¢) AND-OR/NAND-OR gate. 


3 Near-Threshold Computing of ECRL Circuits 


Voltage scaling represents a proper way for reducing the power dissipations in static 
CMOS and the adiabatic circuits. In this section we analyze the energy consumption 
characteristics of the ECRL circuits firstly, and then explore the relationship of supply 
voltage, energy consumption and performance of the basic ECRL gate. 
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3.1 Energy Consumption Analysis of ECRL Circuits 


There are three main energy dissipations in adiabatic circuits, named as adiabatic 
energy dissipation, non-adiabatic energy dissipation and leakage power dissipation. 
For convenience, we will assume a symmetrical trapezoidal waveform in the follow- 
ing considerations, and thus the four phases (evaluation, hold, recovery, wait) are of 
equal duration, which is a quarter of a period T=1/f. 

The energy loss occurs when the nodes of the ECRL circuits are charged or dis- 
charged, which is regarded as the main part in the total energy dissipation. The full- 
adiabatic energy dissipation per cycle of ECRL circuits can be represented as 


E adiabatic = 8 fRCiV5p ? (1) 


where C, is the load capacitance of the ECRL circuits, fis the frequency of the pow- 
er-clock, and R is the turn-on resistance of the PMOS transistors. The energy dissipa- 
tion per cycle of the static CMOS circuits is 


E siatic = CLV oo - (2) 
Based on the Eq. (1) and Eq. (2), the cross-over frequency f, can be derived as 
Se =1/8RC, . (3) 


The cross-over frequency f. will rise with decreasing capacitance C, and R. This 
means that adiabatic computing would more save energy for a given frequency with 
decreasing capacitance C;, and R. 

During the evaluation and recovery phases, as the power clock falls below the IVipl, 
the PMOS transistor is turned off, so that the path between the power clock and the 
output node is disconnected, thus resulting in non-adiabatic energy dissipation. The 
amount of the energy loss is given by 


2 
© oii -adiabadie = Cc, IVip | > (4) 


where IV,,| is the threshold voltage of PMOS transistors. This energy dissipation does 
not depend on the frequency, and it represents the lower bound energy of ECRL 
circuits. 

The leakage power dissipation of ECRL circuits can be expressed as 


Ds(t) = Teak Opp (1) . (5) 


For convenience, we use the average leakage current J,,,, to replace i),,,(t). The aver- 
age leakage energy dissipation per cycle of ECRL circuits is 


: 1 
Fvcak = ie p,(t)dt = fo ieak (t)Vpp (t)dt = fe. DieakY pp (t)dt = 5 Vol tear! : (6) 


where T is the period of the power-clock, Vpp is the peak-peak value of the trapezoid- 
al power clock. The total energy dissipation per cycle of the ECRL buffer can be 
expressed as 
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1 
2172 2 
Evotal = E adiabatic te Enon -adiabatic + Freak = 8 {RC Vdp + C. Vip | +5 Mop! eax! : (7) 


According to Eq. (7), the total energy dissipation can be reduced, when the operation 
supply voltage is lowered. 
When a MOS transistor operates in the linear region, the on-resistance R is 


R=1/(,C = Ves =o) P (8) 


R is inversely proportional to the overdrive voltage. Therefore, the total energy losses 
scale approximately linearly with the supply voltage compared to the proportionality 
to the square of the supply voltage for static CMOS. 


3.2 Near-Threshold Computing of ECRL Circuits 


To estimate the power dissipation of the ECRL circuits in different supply voltages, 
the basic ECRL gates are simulated using HSPICE at a 45nm CMOS process using 
the NCSU PTM BISM4 model. The threshold voltage of PMOS and NMOS transis- 
tors is -0.423V and 0.471V, respectively. The peak-to-peak voltage of the sinusoidal 
power clocks varies from 1V to 0.3V. Fig. 4 shows the energy loss curve of the ECRL 
XOR gate in 50MHz, 100MHz, 5OOMHz, and 1GHz. The size of the MOS transistors 
using in the XOR gate is W/L=40A/2A and A=25nm. 

The operating regions of ECRL circuits can be divided three parts: Subthreshold 
Region, Near-threshold Region and Super-threshold Region. From the energy curve 
shown in Fig. 4, we can observe that the energy dissipation of the ECRL XOR gate 
depend linearly on the supply voltage. For a certain operating frequency, there exists a 
minimum supply voltage, at which the circuits can operate correctly. Though adiabat- 
ic circuits can operate at the subthreshold region, and obtain considerable energy 
reduction, the maximum operation frequency is lower than the other two regions. 
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Fig. 4. Energy consumption of ECRL XOR gates in different supply voltage operating regions 
and frequencies. 
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When the ECRL XOR gate operating at 0.4V peak-peak voltage of the power-clock, 
the maximum frequency only reaches 1OOMHz. Moreover, subthreshold circuits are 
extremely sensitive to variations in supply, temperature and processing factors. 

In near-threshold region, the operation frequency is wider than the subthreshold 
one. For the peak-peak voltage at 0.6V, the maximum operation frequency of the 
ECRL XOR gate is up to 1GHz. And it can save about 50% of energy dissipation 
compare to nominal operation voltage (1.0V). In the near-threshold region, the MOS 
transistors usually are in strong or moderation inversion, the robustness of near- 
threshold circuits are improved greatly than that of subthreshold circuits. These bene- 
fits are much more attractive for a wide variety of new applications. 


4 Near-Threshold ECRL Kogge-Stone Adder 


The Kogge-Stone adder is a carry-lookahead adder with parallel prefix form, and is 
widely used in the industry for high performance arithmetic circuits [9]. In this sec- 
tion an 8-bit Kogge-Stone adder is realized using ECRL gates to verified near- 
threshold computing ideas. The schematic diagram of the 8-bit ECRL Kogge-Stone 
adder is shown in Fig. 5. It consists of propagate and generate signal generation cir- 
cuits, carry-lookahead tree using dot operation and sum generation circuits. 

The propagate and generate functions have been defined as 


P,=A,+B, and G,=A,-B,. (9) 


The carry-lookahead tree using dot operation generates the complete set of carry bits. 
The dot operator is defined by 


(G, P)e(G’, P’) =(G+PG, PP’). (10) 


However, the designed adder does not have an input carry, Cin, since it is to be used as 
a standalone adder, and is not a part of a bigger adder. Therefore, C;, of the adder is 
assumed to be 0, and then the sum generation function can be expressed as 


5, =f, OC, 4 =F, OG, 1% (11) 


In Fig. 5, the triangles are buffers, which are used for maintaining a pipeline. These 
buffers propagate the correct logic values for addition like latch in the general pipe- 
line structure. To obtain the first result of the addition, a few cycles of latency are 
needed. The latency is only 1.25 cycles for the 8-bit adder. 

The 8-bit Kogge-Stone adder is simulated using HSPICE at the 45nm CMOS 
process using the NCSU PTM model. The basic gates using in the adder are shown in 
Fig. 1 and Fig. 2. The device size of PMOS transistors of all the gates is taken with 
W/L = 402/22, and X = 45nm. The size of all NMOS transistors is taken with 40A/2A, 
except for the AND gate’s N3 and N4, OR gate’s Nland N2, AND-OR gate’s N1 
with W/L = 201/21. The energy consumption of the adder operating at different supply 
voltages is evaluated. Fig. 7 shows the energy loss curve of the adder operating at 
0.6V and 1.0V peak-peak voltage of sinusoidal power-clocks. 
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Fig. 5. Schematic diagram for 8-bit ECRL Kogge-Stone adder. 
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Fig. 6. Energy consumption of 8-bit Kogge-stone adder. 


The simulation results show that the proposed near-threshold ECRL 8-bit Kogge- 
Stone adder has considerable power savings. Compared to the conventional ECRL 8- 
bit Kogge-Stone adder operating at the nominal voltage (1.0V), the near-threshold 
ECRL 8-bit Kogge-stone one operating at the peak-peak voltage of 0.6V attains about 
50% energy savings for clock rates ranging from 50MHz to 1.0 GHz. 


5 Conclusion 


In this paper, we have proposed a near-threshold computing scheme of ECRL circuits. 
We analyzed the characteristic of the power consumption of ECRL circuits, and found 
that power dissipation scales down linearly with the supply voltage. For the ECRL 
circuits, the near-threshold operation region is the optimal one compare to the other 
two regions. The simulation results showed that the near-threshold ECRL adder con- 
sumed about 50% of the dissipated energy of the conventional ECRL one operating at 
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normal source voltage. So the near-threshold computing is an attractive method for 
ultra-low-power applications. 
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Abstract. Dual-threshold CMOS (DTCMOS) has been proven as an effective 
way to reduce sub-threshold leakage currents both in the active and standby 
modes, while the high performance of the circuits is still maintained. P-type 
logic circuits can reduce the gate leakage dissipations significantly in nanome- 
ter CMOS processes. This paper presents a dual-threshold CMOS scheme for 
p-type clocked adiabatic logic (CAL) circuits to reduce leakage power dissipa- 
tions. An ISCAS 74182 benchmark circuit from the ISCAS 74X-series is veri- 
fied using DTCMOS p-type adiabatic circuits. All circuits are simulated using 
the 65nm CMOS process with gate oxide materials by HSPICE simulation tool. 
The results show that ISCAS 74182 benchmark circuit based on P-type adiabat- 
ic circuits with the dual-threshold voltage technique can achieve large 
energy savings, since both sub-threshold and gate leakage consumptions are 
reduced effetely. 


Keywords: Logic circuits, DICMOS, P-type logic, Clocked adiabatic logic, 
Leakage reduction. 


1 Introduction 


As technology scaling trends continue in future generations, leakage currents are not 
neglectable any more. Even today, leakage current consumption has become the ma- 
jor contributor to power consumption as technology drops below the 65nm feature 
size. Leakage currents are mainly from the following three sources: sub-threshold 
leakage current due to very low threshold voltage (V;,), gate leakage current due to 
very thin gate oxide (T7,,), and band-to-band tunneling leakage current due to heavily- 
doped halo [1]. The subthreshold leakage currents are exponentially dependent on 
device threshold voltage, while the gate leakage currents increase exponentially with 
reducing gate oxide thickness [2, 3]. 

The DTCMOS (Dual-threshold CMOS) has been proven as an effective way to re- 
duce subthreshold leakage currents both in the active mode and the standby mode. The 
DTCMOS strategy involves using low threshold voltage (V,,) transistors for the gates 
on the critical paths and high threshold voltage (V,;,) transistors for the gates in the non- 
critical paths [4, 5]. High-V,,, devices can be used to reduce leakage currents while low- 
Vi, devices can be used to maintain high performances [6]. The DTCMOS is a very 
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attractive technique for reducing subthreshold leakage currents with no extra area by 
simply using high V,, transistors. 

P-type logic circuits that consist mostly of PMOS transistors have been proposed to 
reduce gate leakage dissipations [3]. The technique is based on the fact that the gate 
leakage through SiO, for the PMOS transistors is an order of magnitude lower than 
for the NMOS ones [3]. The reason for this difference is that electron tunneling from 
conduction band (ECB) is the dominant component of gate leakages for the NMOS, 
whereas it is the hole tunneling from valence band (HVB) for the PMOS. As the bar- 
rier height for HVB (4.5ev) is significantly larger than for ECB (3.lev), PMOS tran- 
sistors have much lower gate leakages than NMOS [3, 7]. 

Adiabatic logic is a promising low power circuit to reduce the energy consumption 
in digital by using a constant current to efficiently charge a capacitor. Therefore, a 
clocked power supply (power clock) is used, that consists of four states. Only during 
one of the four states the whole supply voltage Vpp drops across the gate. Hence a 
reduction of the leakage currents is implemented explicitly by the power clock in 
adiabatic logic circuits [7]. 

This paper explores P-type CAL (Clocked Adiabatic Logic) circuits [8] with dual- 
threshold voltage technique for reducing leakage currents. An ISCAS 74182 bench- 
mark circuit from the ISCAS 74X-series set [9] is verified using the DTCMOS P- 
Type CAL circuits. Both sub-threshold and gate leakage consumptions are reduced 
effetely. 


2 Dual-Threshold Technology for p-Type CAL 


N-type logic circuits, such as CPL (complementary pass-transistor logic) and DVSL 
(differential cascode voltage switch logic) circuits, have been addressed extensively in 
the past years, which use more NMOS transistors than PMOS ones, since NMOS 
transistors has better conduction, smaller area and node capacitance than PMOS ones. 
As MOS transistors get smaller and the gate oxide gets thinner, PMOS transistors 
have lower gate leakage current than NMOS ones. Based on this fact, the circuits that 
consist mostly of PMOS transistors, which are named as P-type logic circuits, can 
reduce gate leakage power dissipations compared with N-type logic circuits [8]. 

Basic P-type CAL buffer/inverter has been reported in [8], as shown in Fig. | (a). 
Cascaded P-type CAL gates are driven by a single-phase power clock (c/k), as shown 
in Fig. 1(b). The structure and operation of the P-type CAL circuit are complementary 
to the N-type CAL one [10, 11]. It is realized by using NMOS loads and PMOS pull- 
up transistors. Its simulation waveforms are shown in Fig. 1 (c). 

The DTCMOS has been widely explored in conventional CMOS circuits. It uses 
low threshold voltage (V,,) transistors in the critical paths and high threshold voltage 
(V,,) transistors in the non-critical paths. Similarly, the DTCMOS can be also used for 
the adiabatic circuits [12]. A dual-threshold scheme for the P-Type CAL buffer is 
illustrated in Fig. 2. The clamp transistors (P3 and P4) use high V7 transistors to 
reduce leakage power dissipations, while the other transistors of the P-type CAL cir- 
cuits use the low V7y transistors to retain circuit performances. 
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Fig. 1. Basic P-Type CAL buffer. (a) schematic and symbol, (b) buffer chain, and (c) simula- 
tion waveforms. 


Based on the power dissipation models of adiabatic circuits, active leakage dissipa- 
tions can be estimated by testing total leakage dissipations using SPICE simulations. 
The estimation technique for basic P-CAL circuits has been reported in [8], which can 
also be used for estimating active leakage dissipations of the DTCMOS p-type CAL 
circuits. 
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Fig. 2. P-Type CAL buffer with dual-threshold technology. 


The energy dissipation per cycle of CAL circuits includes adiabatic energy dissipa- 
tion (Egdiabatic)) NON-adiabatic energy loss (Enon-adiabatic) aNd leakage dissipation (Ezeax). 
The total energy dissipation per cycle of the CAL buffer can be represented as 
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where C, is additional load capacitance of the CAL buffer, which is introduced for the 
purpose of estimating leakage dissipations, T is period of the power-clock, Ry is turn- 
on resistance of the transistors (N1 and N2), Vzy is threshold voltage of NMOS tran- 
sistors, and J/,,, is average leakage current per cycle of P-type CAL circuits. 

According to Eq. (1), the total energy dissipation per cycle of CAL circuits can al- 
so be represented as 


Od 
Be ae s +k,C, +kT , (2) 


where k, is RV t 2s ky is Ven /2 , and k3 is Vopl tea, /2. The leakage power 
dissipation of P-type CAL circuits can be estimated by measuring total energy dissi- 
pations (Ficta, Ftotaz and Fjotai3) in three different capacitances (C,, 2C, and 3C_) of 
power-clocks. The leakage power dissipation of CAL circuits can be given by 


Eveak = Erotal3 = SE rotal2 3Erotal ’ (3) 


3 Combinational Logic Circuits Using DTCMOS P-Type CAL 


An ISCAS 74182 benchmark circuit from the ISCAS 74X-series is shown in Fig. 3 
[9]. The 74182 benchmark circuit based on P-type CAL is verified using the 
dual-threshold voltage technique and single-threshold voltage. All the circuits use P- 
type CAL circuits, which need a single-phase power clock (clk) and an auxiliary clock 
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generator that supplies auxiliary clock CX and CXb of the whole circuits. The aux- 
iliary clock CX and CXb control the cascade circuit alternately. 

We can use dual-threshold technology in the P-Type CAL ISCAS 74182 bench- 
mark circuit, and their gates are realized by using required PMOS logic blocks to 
replace the input PMOS transistors (P1 and P2) of the P-type CAL buffer shown in 
Fig. 2. The simulated waveforms of the ISCAS 74182 benchmark circuit based on P- 
Type CAL are shown in Fig. 4. Fig. 5 and Fig. 6 show the total and the leakage ener- 
gy dissipations per cycle of ISCAS 74182 benchmark circuits based on P-Type CAL 
with the dual-threshold circuits and single-threshold circuits at the 65nm CMOS tech- 
nology, respectively. Compared with P-Type CAL circuits with the single-threshold, 
both total and leakage energy dissipations of the dual-threshold P-Type CAL circuits 
can be reduced effectively. 
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Fig. 3. The ISCAS 74182 benchmark circuit. (a) symbol, and (b) gate-level schematic. 


Fig. 7 shows the leakage power consumption saving rate. Compared with the sin- 
gle-threshold implementation, the leakage power consumption saving of the ISCAS 
74182 benchmark circuit based on the dual-threshold P-Type CAL circuits is about 
48% at 1OOMHz. 
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Fig. 4. The simulation waveforms of the ISCAS74182 benchmark circuit based on P-Type 
CAL. 
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Fig. 5. Es and Ed are the total energy dissipations of the ISACS 74182 benchmark circuits 
based on single-threshold and dual-threshold P-Type CAL circuits at the 65nm CMOS 
technology, respectively.The peak-to-peak voltage of the power-clock pc is 1.1V. 
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Fig. 6. LEs and LEd are the leakage energy dissipations of the ISACS 74182 benchmark 
circuits based on single-threshold and dual-threshold P-Type CAL circuits at the 65nm CMOS 
technology, respectively.The peak-to-peak voltage of the power-clock pc is 1.1V. 
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Fig. 7. Leakage dissipation saving rate of ISACS 74182 circuits based on dual-threshold 


P-Type CAL circuits compared with single-threshold P-Type CAL ones at the 65nm CMOS 
technology. 


232 W. Zhang, L. Su, and Y. Wu 


4 Conclusion 


This paper proposes a dual-threshold technology for P-type clocked adiabatic logic 
(CAL) circuits to reduce leakage power. The ISCAS 74182 benchmark circuits based 
on the P-Type CAL circuits are used to testify this technology. Compared with the 
single-threshold P-Type CAL, the total and leakage energy dissipations of ISCAS 
74182 benchmark circuit based on the dual-threshold P-Type CAL circuits is signifi- 
cantly reduced, since both sub-threshold and gate leakage consumption are reduced 
effetely. 
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Abstract. The cascade H-bridge inverters are widely utilized in high voltage 
and large capacity occasions. Furthermore, it is suitable for the medium voltage 
level active power filter (APF) field. Considering the high speed response is of 
great importance for APF application, the large capacity APF generally applies 
low level switching frequency which will affect the response speed. So it is 
very important to design appropriate modulation method for large capacity 
APF. Phase shift PWM (PS-PWM) method can be regarded as a good solution 
for inverter with low level switching frequency. Further study on PS-PWM is 
made in this paper and an improved PS-PWM method is presented. With the 
novel modulation method, high response speed of APF can be guaranteed with 
relative low switching frequency. The algorism is simple to be implemented. 
A three-stage cascade APF (CS-APF) simulation model is built and simulated, 
and the results show that the novel modulation method is an effective and usa- 
ble solution to promote the response of CS-APF. 


Keywords: CS-APF, modulation, PS-PWM, time delay. 


1 Introduction 


Harmonics exist in the network and EMI brought by harmonics is becoming serious 
with an increasing implementation of nonlinear power-electronic devices. Nowadays, 
passive filter (PF) is the main filter device used in the network, especially in the high- 
voltage and medium-voltage level. Although significant progress has been made in 
active power filter (APF) technology which has been used in some low-voltage (e.g. 
380V) and low-capacity levels, only few are put into practical operation. Constrained 
by voltage level of power-electronic devices, less APFs exist for successful medium- 
voltage applications (e.g. 1|OkV), where series and parallel of several single APFs are 
usually utilized to enlarge the APFs’ capacity and voltage [1, 2]. This undoubtedly 
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requires expensive and large volume transformers. Another solution is to select hybrid 
topology, where the PF devices stress most of the voltage from the grid [2, 3]. 

In cascade H-bridge topology, H-bridge units are in series to achieve high voltage 
levels which makes it possible to connection the inverter directly to the grid without 
transformers. At present, static synchronous compensator (STATCOM) with cascade 
topology have already been applied for commercial application [4, 5]. Furthermore, 
this topology can be directly used in APF field and the relative studies have been 
made by some researchers [6, 7]. Cascade APF is used to enlarge the capacity and 
voltage. However, switching frequency of large-capacity power-electronic devices is 
usually kept at low levels. However, the APF has to respond fast enough in order to 
attain a better harmonic compensation performance. High-frequency devices such as 
IGBT are selected in this application. In practical application, switching frequency of 
large-capacity IGBT (e.g. 1200A/1700V) is only several kHz which is up to the cool- 
ing rate. Meanwhile, frequency of the switching devices is required as lower as possi- 
ble in order to guarantee the stability and efficiency of the whole system. 

Therefore, PWM modulation strategy with fast response and low switching fre- 
quency is of great importance to cascade APF. Low-switching-frequency cascade 
APF is studied in the paper [8]; A hybrid modulation method is presented in paper 
[9]. Low switching frequency is realized in two-level cascade topology but the equiv- 
alent switching frequency is half of the carrier in phase-shifting PWM method. Thus, 
this method is appropriate only in the case of high switching frequency: SVPWM is 
studied in paper [10, 11]. However, there are many redundant vectors with an increase 
of level numbers, which results in complexity of the design. 

This paper presents an improved PS-PWM method, which keeps high equivalent 
switching frequency and fast response speed with real low switching frequency. This 
paper designs a three-stage cascade APF simulation system, and introduces the con- 
trol principles. And the new PWM method is verified by the simulation at the end of 
the paper. 


2 Structure of Cascade APF 


Medium-voltage cascade APF topology is shown in Figurel. Without transformers 
the cascade APF directly connects to the grid. H-bridge cascade can be either delta 
connection or star connection. The star connection is selected here. N is the cells 
number in per phase leg of cascade APF, and it is determined by the grid voltage 
level, together with the H-bridge DC-link voltage level. Capacitor on the DC-link side 
is float and a parallel discharge circuit is added. 


2.1 Control of the Cascade APF 


As shown in Fig2 to Fig4, the control system consists of internal current loop and 
external voltage loop. The harmonic detection method is based on the instantaneous 
reactive power theory (IRP) as shown in Fig2. 
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Fig. 1. Topology of CS-APF 
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The current control loop consists of load current feed-forward control and grid current 
feedback control. Fig3 shows the structure of current loop (The three phase system is 
replaced by single phase for simple). A proportional controller is adopted to achieve 
the current tracking control. To get better performance and suppress the effects of the 
line voltage, a grid voltage feed-forward compensation regulator is adopted, then the 
proportional controller equals as follows: 


een ae (2) 
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Fig. 3. Current control scheme 


For the DC-link voltage loop, a PI controller is designed to keep the total DC-link 
voltage of one phase leg equal to others. However, unbalanced voltage problem may 
exist in CS-APF, and a control method is introduced in papers [6, 7]. Besides the 
reference current generated by the PI controller maybe not suitable for star connec- 
tion, so a discharge circuit is added to the DC-link capacitor. When the voltage is 


higher than the setting level ave this circuit can discharge the capacitor. And it is 


important that the discharging voltage level has to be higher than the DC-link working 
level. A proportional controller is used to control the discharging PWM duty. 
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Fig. 4. DC-link voltage control scheme 


2.2 New PWM Method for Cascade APF 


For CS-APF with PS-PWM modulation method, the phase of each cell carrier wave 
should be shift by g = WA n is the cascade cell number. Besides the left leg and right 
n 


leg modulation wave phase is opposite (Fig5). Then the equivalent switching frequen- 
cy of the output voltage is 2n times of the switching frequency, and the harmonic 
component is very low, as shown in Fig5: 
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Fig. 5. PS-PWM principle 
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Focus on the time delay. As for common two level voltage inverter with SPWM, 
time delay brought by modulation is t=0.5T,. 

Ts is the sampling frequency (equal to switching frequency). The PS-PWM method 
will also bring about time delay as a result of PWM duty sampling. This delay will 
reduce the current loop phase margin. If the gain of current loop is not reduced, the 
current loop may resonate. When the switching frequency is high, carrier modulation 
will produce less delay component compared to the total delay in the system, and the 
modulation delay has no significant effect. The carrier phase-shifting is to improve 
the equivalent switching frequency and the other way round, to reduce the actual 
switching frequency. When the actual switching frequency is reduced, measures must 
be taken into consideration to guarantee the fast response of the inverter. 

Response speed can be improved by increasing the sampling frequency, as is 
shown in Fig6. 


Fig. 6. Repeated sampling for PS-PWM 


Update the duty value in the carrier peak point or valley point of each cell won’t 
affect the actual switching frequency and will improve the response speed of inverter 
simultaneously. Set the sampling cycle to 7,, and then (cascade cell number is n) the 
switching cycle is 2nT,, the equivalent duty cycle after sampling isD(t) . According to 
the V-S (voltage and second balance characters) theory: 


eo T, 37, i. 
gs ea aaah Pi i ] (3) 


==) (pr-F4)) 


N =I 
If the duty cycle is linear during one switch cycle as is shown in Fig6(b), then we can 


get the approximate formula as follows 


‘ nT, 
D(t) =~ D(t- a, (4) 


s bia 
= Dit— switch 
( rae ) 
According to the duty cycle formula with sampling frequency up to 2 times of 


switching frequency, PS-PWM modulation delay is quarter of switching cycle, and it 
will not decrease as the n increase. So with this modulation method and taking time 
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delay into account, the equivalent switching frequency is not equal to the real 2n 
times switching frequency. 

The time delay always exits with digital control and carrier wave modulation. Low 
level switching frequency will bring in long time delay. Even if the above modulation 
strategy was adopted, the time delay of the modulation could be considerable, espe- 
cially in the occasion of very low level switching frequency. 

As Fig7 shows, with PS-PWM strategy, when one stage PWM duty cycle gets up- 
dated, for example D; at the time fj, the D; keeps constant between time fg to ft,. It 
shows that when one cell’s duty cycle is updated, other cells’ duty cycles will be kept 
constant for a predictable time, so these constant duty cycles can be compensated 
during the time. Then the response speed of CS-APF can be increased, in another 
word, the modulation time delay can be reduced. 

The compensation D is derived according to the theory of V-S(volt and second 
balance characters). Taking three-stage cascade modulation for example 


Fig. 7. Duty compensation for PS-PWM 


As Fig7 shows, suppose the reference duty value D ref is linear during one switch- 
ing cycle Tyyircn. D3 will get updated at time tg; D; will be kept constant from f to ¢;, 
and D, will be kept constant from fg to t). Taking the other two stages’ duty cycle (D;, 
D>) errors into account, the compensation for each stage can be set as follows: 


2 1 
BD a Ps) BD De BD (5) 
Then define the new duty cycle value p, 


2D,+D, (6) 
3 


And the other two duty cycles can be derived as follows 


D, = D, + ADy, + AD,, =2D, 


Piaop, coe ea 
D,=2b, zDD. 
D = 2p, —2: D, 
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Time delay introduced by carrier modulation will be significantly reduced after com- 
pensation. The improved modulation method and the former one are compared 
through simulation in the following. 

The bottom two curves of Fig8(a) show the result of tracking the 500Hz-sinusoidal 
wave with the two above modulation methods. For clear expression, both signals 
(output voltage waveforms) shown in the Figure are filtered by filter with same cut- 
off frequency (1kHz). The red one uses the improved modulation method, and the 
other uses the usual PS-PWM. It’s obvious that the improved method can shorten the 
time delay a lot, especially in the case of tracking 1kHz-sinusoidal wave (Fig8(a)top 
ones). 

Fig8(b) shows the phase characteristic of three kinds of modulation method after 
AC sweep. The upper curve is simulated with 12kHz switching frequency and 12kHz 
sampling frequency, the middle one with the improved PS-PWM method and the 
lower one with the former PS-PWM method. 


(a) (b) 
Fig. 8. Comparison of PS-PWM and new PS-PWM 


At last, we can derive the common duty compensation formula as follows 


n+i-k 


D,=D,+5'1 (D, -D)I+ Y (2, -D)I (8) 
i=l n 


n i=k+l 
In the above formula, 7 is the total cascade stage number, D, is the k stage PWM duty 
cycle value, and D, is the new value. 


3 Simulation Study 


A three-stage CS-APF simulation model has been accomplished in the environment of 
PSIM and the simulation parameters are as follows: 


Network voltage: 3kV; Network equivalent inductance: 0.02mH; Load: three phase 
rectifier, resistance: 20Qat start, reduced to 5.72Q at 0.2 second. Cascade stage num- 
ber:3, DC-link voltage of each cell: 1200V; DC-link capacitor: 5000uF; Inductance of 
CS-APF: 0.6mH; Switching frequency:2kHz; Sampling frequency: 12kHz; Discharge 
resistance: 100Q; Discharge switching frequency: 1kHz. 
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Fig. 9. General PS-PWM method 


Fig9 shows static response of the simulation model with general PS-PWM strategy. 
From top to the bottom, Fig9 shows grid current, load current, CS-APF current and 
the reference current for the current closed-loop control. It’s obvious that resonance 
exists in the CS-APF current because time-delay during the control and modulation. 
Only by reducing the gain in the current control loop with the results of lower re- 
sponse, the resonance could be restrained. However, if the modulation strategy is 
replaced by the novel method, the result will be improved (Fig 10). With the novel 
PWM strategy, the resonance has been restrained, and it also increases the response 
speed with lower peak of leakage harmonics. The current ripple is small enough for 
this application and is equivalent to the one with switching frequency of 12kHz. 
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Fig. 10. Novle PS-PWM method 


Figl1 shows the dynamic simulation results of the CS-APF with novel PS-PWM 
method. During the process of the step increase of the load, the CS-APF can also 
response immediately, and the harmonics on the grid current is kept at low level. For 
the real power is not separated completely in the short process due to the LPF in the 
harmonic detection algorithm, the DC-link voltages go lower. But the DC-link capaci- 
tors get charged soon and the voltages are kept at the reference level, which proves 
the effectiveness of the DC-link-voltage control method. 
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Fig. 11. Dynamic process of CS-APF 


4 Conclusion 


The PS-PWM method can improve the equivalent switching frequency for cascade 
inverter, but the time delay brought by PS-PWM is still considerable. The time delay 
almost keeps constant, when the stage number increases. This paper presents a novel 
modulation method for cascade active power filter. With the advantages of high re- 
sponse speed and relative high equivalent switch frequency, this modulation method 
fits for cascade inverter application. At last a simulation model of three-stage CS-APF 
is implemented, and the simulation results proved the above method is effective. 
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Abstract. Edge detection plays an important role in many fields. Many edge 
detection operators such as Roberts edge operator, Sobel edge operator and 
Prewitt edge operator have been proposed to solve edge detection problems. 
Each of them has its own characteristic. But it is difficult to select a suitable 
operator according to a specific problem. In this paper, a novel method for edge 
detection is presented. Firstly, traditional edge detection operators are used in a 
gray image to obtain the results which are viewed as the feature vectors of the 
image data. Then kernel-based fuzzy c-means clustering algorithm is applied in 
these feature vectors to detect out the edge points adaptively. Experimental re- 
sults show the proposed method's efficiency. 


Keywords: Edge detection operator, Fuzzy clustering, Roberts operator, Sobel 
operator, Prewitt operator. 


1 Introduction 


Edge detection plays an important role in many fields such as image analysis, computer 
vision and automated driving et al. [1] Edge points are pixels at which abrupt gray-level 
changes occur because of changes in surface orientation, depth or physical properties of 
materials. The aim of edge detection is providing a meaningful description of identi- 
fying and locating sharp discontinuities in an image. 

There are many edge detection operators available. Commonly used operators 
for edge detection are mainly [2]: Roberts edge operator, Sobel edge operator and 
Prewitt edge operator. Each of them is designed to be sensitive to certain types of 
edges. In addition, some people put forward new edge detection algorithms. Tuba Sirin 
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et al. presents an edge detection method which is based on the use of the clustering 
algorithms and a gray scale edge detector [3]. Wenshuo Gao et al. [4] proposes a me- 
thod which combines Sobel edge detection operator and soft-threshold wavelet de- 
noising to do edge detection on images which include white Gaussian noises. All these 
methods mentioned above are used only single edge detection operator. 

In this paper, an integration method of edge detection operator with kernel-based 
fuzzy c-means clustering algorithm (KFCM) is presented. The rest of this paper is 
organized as in the following; commonly used edge detection operators are discussed in 
the next section. In section 3 KFCM used in the proposed method is discussed. Section 
4 describes the proposed algorithm. The comparative experimental results are dis- 
cussed in Section 5. Conclusions are given in the last section. 


2 Edge Detection Operators 


2.1 Roberts Edge Operator 


The Roberts operator is very quick to compute 2-D spatial gradient measurement on an 
image. It highlights regions of high spatial frequency which often correspond to edges. 
In its most common usage, the input to the operator is a grayscale image, as is the 
output. Pixel values at each point in the output represent the estimated absolute mag- 
nitude of the spatial gradient of the input image at that point. Only four input pixels 
need to be examined to determine the value of each output pixel, and only subtractions 
and additions are used in the calculation. In addition there are no parameters to set. Its 
main disadvantages are that since it uses such a small kernel, it is very sensitive to 
noise. 


2.2 Sobel Edge Operator 


The Sobel operator is a discrete differentiation operator, computing an approximation 
of the gradient of the image intensity function to find the edge. If one takes the 
derivative of the intensity value across the image and find points where the derivative is 
maximum, the edge could be found. The Sobel operator is based on convolving 
the image with a small, separable, and integer valued filter in horizontal and vertical 
direction and is therefore relatively inexpensive in terms of computations. The result 
of the Sobel operator is either the corresponding gradient vector or the norm of 
this vector which is relatively crude, in particular for high frequency variations in the 
image. 


2.3 Prewitt Edge Operator 


The Prewitt edge detector is an appropriate way to estimate the magnitude and orien- 
tation of an edge. It calculates the maximum response of a set of convolution 
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kernels to find the local edge orientation for each pixel. The set of kernels is limited 
to eight possible orientations. However experience shows that the Prewitt edge 
detector has a major drawback of being very sensitive to noise. In addition, the size of 
the kernel filter and coefficients are fixed and sometimes it cannot be adapted to a given 
image. 

Thus, it is necessary to provide a robust edge detection algorithm to help distinguish 
valid image contents from visual artifacts introduced by noise. 


3 Kernel-Based Fuzzy c-Means Clustering Algorithm (KFCM) 


The fuzzy c-means (FCM) algorithm was introduced by J.C.Bezdek [5]. Given 
eA is tpi x, } where x, in R’, the idea of FCM is using the weights that mi- 
nimize the total weighted mean-square error: 


J,U)=S ou 


i=l k=l 


x, -v, |, (1) 


Here c is the number of clusters, n is the number of data points, and u, is the 


membership of x, in class i takes value in the interval [0,1] such that 


7 =] (2) 


forall k. 

Dao-Qiang Zhang et al. [6] introduced kernel methods to FCM and proposed ker- 
nel-based fuzzy c-means clustering (KFCM) algorithm which is a robust clustering 
approach. 

As we know, a pattern in the original input data space X can be mapped into the 
higher dimensional feature space F through the nonlinear mapping function ® . 


®:X =(x,,x,,.. x,) 9 ®(X) = (@(x,),..., (xy )) (3) 


The objective function of KFCM is constructed as follows: 


J, Uv) = Yuk (4, - OP (4) 


i=l k=l 


Scalar product calculation in input space is transformed into kernel function calculation 
by nonlinear mapping 


(x, °x;) > (®(;)-®(x))) = K(y,4)) = K; (5) 
Thus, we have 


|(x,)- BO, | =K Oy. 4) + KV) -2K (GY) (6) 
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To minimize J,, under the constraint of Eq.(2), the alternate iterative equations can be 
obtained as follows: 


HK (%,.%,) + K(,.V,)—2K@,.v, er 


: (7) 
Yi IMK (x, 5%) + K(v,.¥,)-2K (xv, er 


ik 


a u"K (x, Vid, 
y= (8) 
Demin K (a, 5¥,) 
k=1 


Here we use Gaussian kernel. It can be defined as follows: 


K(x,,v,)=exp(- ) (9) 


2: 

| -vi 
2 
20 


KFCM algorithm can be summarized in the following steps: 
Step 1: Fix ¢ and select parameters, ¢,,,,m>1 and €>0 for some positive 
constant. 
Step 2: Initialize the memberships u’, . 
Step 3: For t=1, 2,..., fax , do: 
(a) update all v, with Eq.(8); 
(b) update all memberships u;, with Eq.(7); 
(c) if max, , be -u'" | S$ €, stop; 


End. 


4 Proposed Edge Detection Method 


A two-stage edge detection operator integration method is applied on the images. 
Novelty in our study is the use of KFCM to combine three edge detection operators. 
The stages of our algorithm as follow: 


Stage 1: Gray scale edge detection by using Roberts, Sobel and Prewitt operators. 
Stage 2: Clustering of the results of the edge detection operator by using KFCM. 


Firstly, a pixel point in a gray image is regarded as a data sample, and its gray values 
which are processed by Robert operator, Sobel operator and Prewitt operator make up 
of the feature vectors of this data sample. In this way a data set with three-dimensional 
features can be obtained. Then the KFCM clustering algorithm is used in this data set, 
the edge points can be detected out adaptively. 
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5 Experimental Results 


Edge detections of traditional edge detection operators and the proposed method are 
performed on the gear image. Fig. | is the gear image and Fig. 2 is edge detection 
results of Fig. 1. Fig. 3 is the gear image with Gaussian noises and Fig. 4 is edge de- 
tection results of Fig. 3. As is shown in the experimental results, the proposed method 
yields the best results. 


Fig. 1. Gear image 


Roberts 


Prewitt 


Proposed method 


Fig. 2. Edge detection results of gear image 


Fig. 3. Noised gear image 
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Roberts 


Prewitt Proposed method 


Fig. 4. Edge detection results of noised gear image 


6 Conclusions 


A novel method for edge detection is presented. It uses kernel-based fuzzy c-means 
clustering algorithm to combine Roberts, Sobel and Prewitt edge detection operators to 
detect out the edge points adaptively. The resulting edge images show that the per- 
formance of the proposed integration method is superior to the single edge detection 
operator. In the future, we will also explore our previously proposed clustering algo- 
rithm [7-8] to integrate the edge detector operators. 
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Abstract. In this paper, we introduce a new approach, using combined feature 
for fingerprint matching. Our proposed approach is the idea that this feature 
combines the information of every two singular points such as core and delta, of 
the image and takes the ridges structure between them into account. The com- 
bined feature is invariant with respect to the global transformations such as ro- 
tation and transformation. The proposed recognition is performed in three steps: 
singular points extraction, combined features extraction and matching. Experi- 
mental results on FVC2006 show efficiency and accuracy of the proposed 
method. 


Keywords: Image processing, fingerprint matching, singular point extraction, 
core and delta detection. 


1 Introduction 


The fingerprint -a biological feature of humankind- has many particular properties 
such as uniqueness, stableness, and inseparability from the host. It has been used for 
personal verification for more than one hundred years, and is the most widely used 
biological recognition technique today. Among the many current biometric technolo- 
gies, fingerprint matching is the oldest and the most popular method widely used in 
different commercial and security applications. More than 100 features are defined for 
representation of a fingerprint. Singular points such as minutiae, Core and Delta are 
shown in Figure 1. 

Among the features, ridge ending and ridge bifurcation, shown in Figure 1, are the 
most commonly used features which are named minutiae points. Several minutiae- 
based approaches have been proposed to match fingerprints [1], [2], [3], [4]. Some of 
them are based on singular points [5] and some others are independent of singular 
points [1], [2]. There are two major types of features that are used in fingerprint 
matching: local and global features. Local features such as minutiae contain the 
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information in local area and invariant with respect to global transformation such as 
rotation and transformation. On the other hand, global features, such as number, type 
and position of singularities, spatial relationship and geometrical attributes of ridge 
lines, size and shape of the fingertips are characterized by the attributes that capture 
the global spatial relationships of a fingerprint [6]. It has been showed that the geo- 
metric deformation can be more easily controlled than global deformation [7]. In [1], 
[2], a secondary feature has been introduced using relative distance, radial angle and 
minutia orientation. Jiang and Yau [8] have used this relative information along with 
ridge count and minutiae type. 


" Ridge Fingerprint 
' Bifurcation 
a Preprocessing 
Core 
Delta —- 
: Directiona 
8 Fiald 
— Z (A Ridge Estimation | 
Z aA ae Singular Points —>| Combinec |» Matchinc 
Extraction Features 5 
A i 
Singular 
Pointe 


Fig. 1. Singular Points (Core and Del- Fig, 2. The proposed fingerprint matching 
ta) and Minutiae (ridge ending and system. 
ridge bifurcation). 


In this paper, we introduce a combined feature for fingerprint matching. The com- 
bined features are derived from singular points features. We use the minutiae-based 
representation of a fingerprint. The combined features are extracted from image ob- 
tained by preprocessing steps. The feature, introduced in Section 3, is defined for 
every two points, delta and core. Figure 2 shows the proposed matching system. 
Three main steps of our proposed method are: 1) Singular points extraction 2) Com- 
bined feature extraction 3) Matching 

In this paper, it is assumed that a fingerprint with no distortion is available. The 
combined features are extracted from the clear fingerprint. In Section 2, we extract 
singular points. The new combined feature is described in section 3. Matching is de- 
scribed in section 4. Experimental results on FVC2006 are presented in Section 5. The 
paper is concluded in Section 6. 


2 Singular Points 


In this section, we use an accurate method to singular points detection [9]. In [9], a 
method is presented for using directional fields of fingerprints images and orientation 
in extraction of singular points. The main results are repeated here. 
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2.1 Directional Field Estimation 


The directional field describes the coarse structure, or basic shape, of a fingerprint [9]. 
The directional field is defined as the local orientation of the ridge-valley structures. 
This is for instance used for classification of fingerprints. In [10], a method is pre- 
sented for the estimation of a high-resolution directional field. The main results are 


repeated here. The method is based on the gradient vector IG, @y) Gy (x,y ye of 


the grayscale image J (x, y ) , which is defined by: 


61 (x,y) 
Gy (,y) Ox 
=VI(x,y)= 1 
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The directional field is, in principle, perpendicular to the gradients. However, the 
gradients are orientations at pixel scale, while the directional field describes the orien- 
tation of the ridge-valley structures. This requires a much coarser scale in which local 
fluctuations do not have influence. Therefore, the directional field can be derived 
from the gradients by performing some averaging operation on the gradients, involv- 
ing pixels in some neighborhood [11]. 

Gradients cannot simply be averaged in some local neighborhood, since opposite 
gradient vectors will then cancel each other, although they indicate the same ridge- 
valley orientation. A solution to this problem is to double the angles of the gradient 
vectors before averaging. Then, opposite gradient vectors will point in the same direc- 
tion and therefore will reinforce each other, while perpendicular gradients will cancel. 
After averaging, the gradient vectors have to be converted back to their single-angle 
representation. The main ridge-valley orientation is perpendicular to the direction of 
the average gradient vector. This method was proposed by [12] and was adopted in 
some way for the estimation of the directional field of fingerprints by various 
researchers. 

In the version of algorithm used in this paper, not only the angle of the gradients is 
doubled, but also the length of the gradient vectors is squared, as if the gradient vec- 
tors are considered as complex numbers that are squared. This has the effect that 
strong orientations have a higher vote in the average orientation than weaker orienta- 
tions and this approach results in the cleanest expressions. 

Another difference of this method is that we do not estimate the average directional 
field for a number of blocks in the image. Instead, the directional field is estimated for 
each pixel in the image using a Gaussian window W for averaging. Us- 


ingG 25 GF: , G = X67, and G._ =XG,G_.The average gradient direc- 
xx V5 xy ty 


; en i recs pl 
tion 6, with ae O< 57 is given by: 0 = 5 L Gx “64, 2G. ) (2) 
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2.2. Singular Points Extraction 


Singular points are the discontinuities in the directional field. They are clearly visible 
in a fingerprint image, as can be seen from Figure 1, which shows examples of the 
two different types of singular points that are called core and delta. Singular points 
can for instance be used for the registration (alignment) of two fingerprints of the 
same finger. Many different algorithms for singular point extraction are known from 
literature. The method that is used here is based on the Poincare index [13]. It can be 
explained using Figure 3(a). Following a counter-clockwise closed contour in the 
directional field around a core results in a cumulative change of z in the orientation, 
while carrying out this procedure around a delta results in—z. On the other hand, 
when applied to a location that does not contain a singular point, the cumulative ori- 
entation change will be zero. 


(a) Directional field (b) Squared Directional field (c) Gradient (d) Rotation 


Fig. 3. Processing Steps in the Extraction of a Core. 


The method is capable of detecting singular points that are located only a few pix- 
els apart. In the rest of this section, all calculations are made for the case of a core. It 
is left to the reader to adapt them to the case of a delta. First, the squared directional 
field is taken. This eliminates the transition of z which is encountered in the direc- 


; : ; . 1 1 3 
tional field between the orientations 6 = 5 az and @= ae . As a result, the Poincare 


index is doubled. The orientation of the squared directional field is depicted in Figure 
3(b) for the area around the core example. Instead of summing the changes in orienta- 
tion, it is possible to sum the gradients of the squared orientation as well. The gradient 
vector J can be efficiently pre-calculated for the entire image by: 


020(x,y) 
J (x.y) l=. 
x =V20(x,y) = Ox (3) 
J (x,y) 620(x,y) 
y ee 
dy 


In the calculation of the discrete version of this gradient, both components of J should 
be calculated ‘modulo 2z ’, such that they are always between —z andz . This makes 
the transition from 20 =—-z to 26=z continuous or, in other words, the orientation 
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is considered to be cyclic. The gradient vectors of the squared orientation around the 
core are shown in Figure 3(c). The next step is the application of Green’s Theorem, 
which states that instead of calculating a closed line-integral over a vector field, 
the surface integral over the rotation of this vector field can be calculated. This theo- 
rem is applied to the summation of the gradients of the squared orientation over the 
contour: 


oJ 
LJy tJ. = Loot, I ney ae 0s (4) 


Where A is the area and 0A is the contour. Since the directional field is assumed to be 
smooth, A can be taken as small as a square of 1x1 pixel. This results in a very effi- 
cient method for computation of the Poincare index. Application of the proposed 
method will indeed lead to the desired singular point locations. Unlike all other singu- 
lar point extraction methods, a core results in a Poincare index of 2za delta 
in —2z while the index for the rest of the image is exactly equal to 0. This is illustrated 
in Figure 3(d) for the core example and the area around it. 


2.3. Orientation of Singular Points 


The last stage of singular points detection is the estimation of the orientations @ of the 


extracted singular points. The method that is described here makes use of the squared 
gradient vectors in the neighborhood of a singular point, both for the image to be 
analyzed and for the reference singular point. The averaged squared gradients of the 
core, repeated in Figure 4(a), ideally look like the reference model in Figure 4(b). For 
acore at(x,y ) = (0,0), this reference model are given by: 


(y sx ) (5) 


Core Ref = D2 


The model of a core that has rotated over an angle Mis given by a reference model 


with all its components multiplied by e ue Coreg = Coreper 2/® (6) 


This property is used for the estimation of the orientation of the core. The orienta- 
tion of the core, with respect to the reference model, is found by taking the inner 
product of the estimated squared gradient data and the complex conjugated of the 
reference model, depicted in Figure 4(c). Then, it is divided by the number matrix 
elements N, and the angle is taken. 


p= 2 = Core «7 ef VCore, (.¥) (7) 


For a delta, a related reference model can be made. Because of the symmetries in a 
delta, its relative orientation with respect to the reference model is given by one third 
of the angle resulting from Equation 7, which was used for a core. 
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(a) Squared Directional field (b) Reference Core (c) Conjugated (d) Orientation 


Fig. 4. Processing Steps in the Calculation of the Orientation of a Core. 


3 Combined Features 


The combined features are derived from singular points and localized information 
between them. The combined feature is defined for every two singular points in the 
skeletonized image. We use ridge count between them along with relative orientation 
of each middle ridge and the line connects the two singular points. Ridge count is an 
abstract measurement of the distances between any two points in a fingerprint image 
[14]. Let a and b be two points, core and delta in a fingerprint; then the ridge count 
between a and b is the number of ridges intersected by segment ab. Figure 5 illustrates 
the combined feature. For two points, core and delta 


m, (x; 9; 9 type; ) and m ; (xj vi 9; type j ) we contrast a vector fj as follows: 


Ts. 32 rc 
Sf ij HD Vk Dp iy Oi OG OY ] (8) 


Where (x, ,y; ) and (x pe j are location coordinates of m; and m je is the ridge 


ij 
count between m; and m F o is the kth relative orientation of middle ridges and the 


line which connects m; and m ; o, is the clockwise rotation of m,m, to middle ridge. 


The ridge count and the relative orientation is calculated while scan the skeletonized 


image from m, to mj 


(a) (b) 


Fig. 5. (a) The combined feature on singular points, Core and Delta (b) the combined feature on 
two minutiae. 
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Since different impressions of a fingerprint cause a variety of orientation, we con- 
sidered some regions of orientations as follows [15]: 


Xu a 8 
R, =0<56, <— (9), Ry =— 58, <— (10) 
4 4 2 
1 30 3x 
R, =—<0, <— (QI, «| -Rg=— 56, <2 (12) 
2 4 4 


In order to increase the stability of the combined feature a tolerance area is consid- 


ered. R r Ry ,R 3 and R 4 with constant tolerance area are shown in Figure 6. 


We consider a constant interval for tolerance area. 


B 
oe nee Area 


“ 


Fig. 6. The Orientation Regions. 


1 1 1 
S$, :056 <—+y7 q3) =, Sy: —-ySO6,<—+Y (14) 
4 4 2 


a 3a 3a 
S3:—-ysS0,<—+y (15) , Sg:—-7s6, <a (16) 

2 4 4 
A set of combined features, Q, is stored in template database for each fingerprint. For 


: ; F ? ., a(n —I) 
an image with n singular points the template size is : 


2 


y.,r..,0..,..0.. >} 


QO ={4, = sf lm; # mj. sf ij SOX X ys j ip Oj Fi 


n(n-1) 


2 eee Hd eg HN 2a n (17) 


4 Matching 


The larger rc the combined feature has, the more distinctive it is. The combined fea- 
ture with the largest rc is selected as a reference point in input fingerprint image and 
the best matched combined feature with the same rc is selected as a reference point in 
template fingerprint images. The input fingerprint aligns with the template fingerprint 
according to this reference points. The similarity score is a two element 
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vector < Sc,,Scy >that Sc, corresponds to the number of matched combined features 


and Sc,is the sum of Euclidean distance between every two matched combined fea- 


tures in input image and template images. 
Let I be the input fingerprint image, a set of candidate fingerprint template, MF, is 


provided according to S Cy: 


MF ={Q,|Sc,(Q; .1) > @} (18) 


#of matched combined features (Q,I) 
Se, (Q,1) = (19) 


#of total combined features of input image 


n(n-l) 
2 
Se= 2 ki -4,| (20) 
4; (5)=4 ; () 


Where 4(5)is the fifth element of the combined feature corresponds 
to rq; (5) =4 j (5)in formula (16) means that Sc, is calculated for the combined fea- 


tures of the input image and template image with the same rc. 


The template that has the largest value of Sc, is selected as the matched template. 
For two templates with the same Sc, , the second element, Sc, is used to make the final 


decision. The template that has the smallest Sc, is selected as a result. 
5 Experimental Results 


In the experience, y= 2 andly = 0.9have been used (See (14)-(17) and (19)). The 
36 


Normalized Matching Score are shown in Figure 7. The experiments reported in this 
paper have been conducted on the public domain collection of fingerprint images, 
DB3 in FVC2006. It comprises 800 fingerprint images of size 300x300 pixels cap- 
tured at a resolution of 500 dpi, from 100 fingers (eight impressions per finger). We 
conducted a set of experiments on DB3 meant to compare our algorithm with the 
approach proposed in [16], which matches fingerprints based on both the local and 
global structures of minutiae. For convenience we label the method in [16] as II. The 
algorithm II uses the local minutia structures for registration and computes matching 
score based on the similarity level of corresponding local structures. Matching ex- 
periments have been performed on DB3 using algorithms I and II. We have per- 
formed the two algorithms on DB3 and obtained the results expressed in terms of 
ROC curves, as shown in Figure 8. The results demonstrate that our method is more 
effective than algorithmlI. 
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Fig. 7. Normalized Matching Score. Fig. 8. ROC-Curves on DB3 obtained 


with our matching method (solid line) and 
the algorithm VI (dash-dot line). 


6 Conclusion 


In this paper, we proposed a new localized feature based on singular points detection. 
With using these singular points of the fingerprint we apply our approach named 
combined feature. This feature combines the information of two singular points and 
the ridges structure between them. The combined features have the following advan- 
tages: 1) These are generated from singular points, so this method can be adopted for 
existing applications. 2) These are invariant with respect to translation and rotation. 3) 
Since reference point detection includes a search for the largest rc, the alignment of 
two fingerprints is simple and fast. 4) The experimental results show that the pro- 
posed method is efficient and suitable for fingerprint matching. 
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Abstract. The Transmission Control Protocol (TCP) is the most popular trans- 
port layer protocol for the internet. Congestion Control is used to increase the 
congestion window size if there is additional bandwidth on the network, and 
decrease the congestion window size when there is congestion. This paper uses 
a classic TCP which we called Robust TCP with an accurate algorithm of con- 
gestion detection in order to improve the performance of TCP. Our TCP Robust 
only reacts when it receives an ECN (Explicit Congestion Notification) mark. 
The evaluation result shows a good performance in the terms of drop ratio and 
throughput. 


Keywords: Congestion Control, TCP, CN, Implicit Congestion 
Notification. 


1 Introduction 


TCP is a connection-oriented, end-to-end reliable protocol designed to fit into a 
layered hierarchy of protocols which support multi-network applications. Congestion 
events in communication networks leads to packet losses, and it's well known that 
these losses occur in burst. TCP congestion control involves two tasks: 


1. Detect congestion 
2. Limit Transmission rate 


To achieve good performance and obtain a Robust TCP, it is necessary and important 
to control network congestion, by limiting the sending rate and regulating the size of 
congestion window (Cwnd) after the detection of congestion.TCP congestion control 
operates in a closed loop that infers network conditions and reacts accordingly by 
means of losses. A negative return is due to a loss of a segment which can be trans- 
lated by decreasing the flow from the source through a reduction in the size of win- 
dow control. TCP considers loss of a segment as a congestion in the network, the 
detection of this loss can be done in several ways: Timeout, Three Duplicate ACKs 
(Fast retransmit) and by receiving a partial ACK. 
The state is: 


e If Packet Loss or congestion event =>TCP decreases Cwnd. 
e Allis well and no congestion in the network, i.e., TCP increases Cwnd. 


J. Lee (Ed.): Advanced Electrical and Electronics Engineering, LNEE 87, pp. 261-270, 
springerlink.com © Springer-Verlag Berlin Heidelberg 2011 
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At all cases, loss indication should be done with accuracy because it may lead to false 
indications like: Spurious retransmission. 

Spurious timeout occurs when a non lost packet is retransmitted due to a sudden 
RTT (Round Trip Time) increase (hand over, high delay, variability, rerouting . .) 
which implies to an expiration of the retransmission timer set with a previous and thus 
outdated RTT value. This effect is known to be the root cause of spurious retransmis- 
sion. The function of the congestion control is an essential element to the stability of 
the internet. Indeed, TCP congestion control reduces the flow when it detects a loss in 
the network. Therefore, it is important to be accurate in the loss detection to improve 
the performance of TCP. A congestion event (or loss event) corresponds to one or 
several losses (or in the context of ECN: at least one acknowledgment path with an 
ECN-echo) occurring in one TCP window during one current RTT period, it means 
that a congestion event begins when the first loss occurs and finishes one RTT later. In 
this paper, we propose a congestion detection algorithm that is realized independently 
of the TCP code. To improve the TCP by reducing the Cwnd, we aim to illustrate the 
feasibility of the concept by demonstrating that we can both obtain similar perfor- 
mances and also improve the accuracy of the detection outside the TCP stack. We 
implement the Implicit Congestion Notification (ICN) algorithm to better understand 
and investigate the problem of congestion events estimation. This paper is organized as 
follows: section 2 presents related works, section 3 shows the architecture of the con- 
gestion detection, section 4 presents the detailed discussion for the Robust TCP with 
ICN congestion detection algorithm, and section 5 presents an evaluation of the TCP 
Robust using simulations. Finally, section 6 concludes this article and presents some 
perspectives. 


2 Related Works 


Over the past few years, several solutions have been proposed to improve the perfor- 
mance of TCP. In [5] proposed TCP-DCR modifications to TCP's congestion control 
mechanism to make it more robust to non-congestion events, this is implemented by 
using the delay "tau" based on a timer. Our mechanism is different; it relies on the 
accurate congestion detection algorithm (ICN) and uses the timestamp option to 
detect spurious timeout which can more improve the reliability of the algorithm and 
leads to a real Robust TCP. In Forward RTO-Recovery (F-RTO): the F-RTO algo- 
rithm of the TCP sender monitors the incoming acknowledgments to determine 
whether the timeout was spurious. TCP suffers from the inaccuracy of the congestion 
detection in the other TCP agents, for this reason we design an accurate mechanism of 
congestion detection (ICN) that interacts with TCP robust. Our study must prove the 
functionality of our TCP with ICN is better than other versions of TCP. For this point 
we have to show that the mechanism of congestion detection for some TCP variants 
(New-Reno, Sack) doesn’t detect well when there is congestion and doesn't work 
better than TCP Robust with ICN. In [5], the idea or the solution proposed for the 
detection of congestion is the delay of the time to infer congestion by T, and this val- 
ue should be large to recover from non-congestion event, and should be small to avoid 
expensive RTO. Our approach is different by using a classic TCP that responds only 
to an accurate algorithm of congestion detection. 
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3 Stand-Alone TCP Congestion Events Algorithms 


In this section we present the architecture of decorrelating congestion Detection from 
the Transport Layer (figure 3).The main goal of this architecture is to simplify 
the task of kernel developers as well as improve TCP performances. This scheme 
opens the door to another way to react to congestion by enabling ECN emulation at 
end-host. In this case ICN emulates ECN marking to imply a congestion window 
reduction. 


( TcPwindowreacttoECN | 
signaling | 


CWR: Congestion window 


reduced 


TCP packets 


[ ICNzimplictt Congestion ) TCP packets 


Notificat 


Use ACKtos 


Mark ECON to signal CE 


— 


ACK /ECN 


Fig. 3. Decorrelating Congestion Detecting from the Transport Layer 


4 Enhancement of TCP Algorithms 


Our proposed algorithm which we called Robust TCP is to make the congestion de- 
tection reliable and to distinguish the causes of losses in order to improve the flow 
control. The main idea is to determine CE (i.e. the congestion detection) which impact 
on the TCP flow performance by monitoring the TCP flow itself. The principle is to 
obtain a detection system at the edge of a network or at the sender side which analys- 
es the TCP behavior through the observation of both data packets and acknowledg- 
ments paths. So, the scenario is to make a new version of TCP (Robust TCP) without 
detection of congestion. Robust TCP doesn't reacts (reducing of Cwnd) whenever it 
doesn't receive a notification ECN. Robust TCP must interact with ICN algorithm 
through ECN. Once we have congestion indication and the congestion event is vali- 
dated, in this case it must notify the TCP we are exploring the functionality of Robust 
TCP and the ICN algorithm with the interaction between each other. Robust TCP 
maintains all the functions of TCP Reno (slow start, Congestion avoidance, Fast re- 
transmit and Fast recovery) and modified by adding error control and limited transmit 
(like in New-Reno TCP) to avoid unnecessary timeouts. 

Robust TCP is a classic version of TCP but very sensitive to packet loss. It con- 
tains the major congestion control phases: 


1. Slow start 
- When ACK received: cwnd++ which means for every ACK received, 
the sender sends two more segments. 
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- Exponential increase in the window (Every RTT: cwnd = 2*cwnd) 
- Threshold (sstrhesh) controls the change to congestion avoidance. 
2. Congestion avoidance (increase Window size). 

- When ACK received: cwnd+ = 1/cwnd. 

- Linear increment of cwnd (every RTT: cwnd++) slow start is exists until 
cwnd is smaller or equal to ssthresh. Later congestion avoidance takes 
over. 

3. Fast retransmit (Detection of congestion). 

- TCP generates duplicate ACK when out-of-order segments are received. 
In this case Fast retransmit uses "duplicate ACK" to trigger retransmis- 
sion packets, so the sender does not wait until timeout for retransmis- 
sion, sender retransmits the missing packet after receiving 3 DUPACK. 

4. Fast recovery. 

- TCP retransmits the missing packet that was signaled by three duplicate 
ACKs and waits for an acknowledgment of the entire transmit window 
before returning to congestion avoidance. If there is no acknowledg- 
ment, TCP Robust experiences a timeout and enters the Slow-start state. 


TCP recovers much faster from fast retransmit than from timeout. When congestion 
window is small, the sender may not receive enough dupacks to trigger fast retransmit 
and has to wait for timer to expire but under Limited transmit; sender will transmit a 
new segment after receiving | or 2 DUPACKs if allowed by receivers advertised 
window to generate more dupacks. Robust TCP is poor in performance without detec- 
tion of congestion and worse than other TCP like TCP New-Reno and Sack. It reacts 
only on the receiving of ECN notification. Once it doesn't receive a notification that 
means there is no congestion control on TCP and the window keep increasing, but in 
case of receiving ECN that will indicate the occurrence of congestion indication noti- 
fied by ICN, than Robust TCP reacts by limiting its sending rate and takes the full 
meaning of its name. 


4.1 ICN with Timestamp 


ICN (implicit congestion notification) is an algorithm for congestion detection im- 
plemented outside the TCP stack to analyze TCP flow and to better understand the 
problem of congestion events and then to conclude if the congestion occurs in the 
network or no and it is also more accurate in congestion detection than TCP. The 
main goal of ICN is to determine the losses (i.e. the congestion detection) which im- 
pact on the TCP flow performance by observing the flow itself which mean by look- 
ing at the losses occurring over an RTT period given. ICN is a generic algorithm that 
doesn't depend on the TCP version used which implements a congestion control 
where a negative feedback means a loss. It is important to note that ICN doesn't man- 
age the error control which remains under the responsibility of TCP. Starting from the 
observation of the data segments and the acknowledgments, we identify each TCP 
connection with a state machine. This state machine indentifies the control congestion 
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phase and classifies retransmission as spurious or not.TCP congestion control reacts 
following binary notification feedbacks allowing assessing whether the network is 
congested or not. ICN algorithm consists of two states: 


1. Normal state: which characterizes TCP connection without losses, in this state no 
congestion occurs and the sender receive the ACK normally. 

2. Congestion state: This state starts from the loss of the first window data segment. 
When a loss occurs ICN enters in this state and waits to the congestion event to be 
validated to notify Robust TCP about this loss. When the top of the window is ac- 
knowledged, ICN enters in the normal state. 


To improve the performance of the congestion detection algorithm and especially 
against spurious timeout we added the timestamp option, in order once the congestion 
happens ICN enter in this state and append a timestamp to let the sender to compute 
the RTT estimate based on returned timestamp in ACK. Time stamps used in this 
state to measure the round trip time (RTT) of a given TCP segment and including 
retransmitted segment, this option also can help to eliminate the retransmission ambi- 
guity ( due to false indication) and identifies when retransmission is spurious or not. 
Spurious Timeout are inevitable and not rare in data networks, for this reason and 
once the congestion event occurs, ICN enter in the congestion event state, timestamp 
is added for each data segment. Timestamp can be considered as an acknowledging 
mechanism in the time domain. In the figure (4) shown below we will present the 
flowchart of TCP Robust with ICN mechanism: 


Néiaal Congestion Event 
Packet i 
Recetved 
Successfully Swit 
‘tia No Nomnal 
congestion State 


Yes 


Congestion 


ECN 


¥ 
Robust Tcp Agent 


L_____s Slow-down 


Fig. 4. Robust TCP with ICN detection algorithm 
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4.2 Robust TCP and ICN interaction 


ICN is an accurate congestion detection algorithm where after detecting a loss event 
in the congestion state, the congestion event (CE) must be validated. The validation of 
CE should lead to a congestion indication which is the principle responsible to inform 
the Robust TCP about the congestion. The confirmation method due to a congestion 
indication is ECN (Explicit congestion notification), which is the main fag in the 
ACK to notify the loss to the source TCP. Once the source is signaled by ECN notifi- 
cation it reacts by reducing its window (Cwnd) and this time Robust TCP takes the 
full meaning of its name. After reducing its window, we can notice very well the 
decreasing of the number of dropped packets (d) in the network due to using of ICN 
congestion detector and our TCP becomes better in performance than others like TCP 
New-Reno and Sack. 


5 Validations and Evaluations 


In this section we evaluate the performance of Robust TCP with ICN algorithm. 
The main idea is to build an algorithm of congestion detection outside the TCP stack 
that is responsible to detect the loss and notify it to Robust TCP. The architecture of 
our tools is shown in the figure (3), which is mainly composed from the following 
components: 


1. Network topology. 
2. Traffic model. 
3. Performance evaluation metrics. 


After the simulation is done, a set of result statistics and graphs are generated. 


Network — | | Traffic Model Performance | 
Topology Metrics 


* * x 
Results in statistics and Graphs 


Fig. 5. Architecture of our tools 


5.1 Network Topology 


To study our TCP and ICN behavior we built our Network and application model 
shown in figure 6, in which source nodes and sink nodes connect to router | or router 
2. The bandwidth between the two routers is much lower than the other links, which 
causes the link between the routers to be a bottleneck. (Traffic can be either uni- 
directional or bidirectional). 
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Fig. 6. Network topology 


5.2 Traffic Model 


The tool attempts to apply the typical traffic settings. In our application include the 
FTP traffic that uses infinite, non-stop file transmission, which begins at a random 
time and runs on the top of TCP. Implementation details and a comparative analysis 
of TCP Tahoe, Reno, New-Reno, SACK and Vegas choices of TCP variant are de- 
cided by users. 


5.3 Performance Evaluations Metrics 


The metrics used in our simulations are Throughput and Drop ratio. Throughput is the 
total elapsed flow since the beginning of simulation time. Throughput may also in- 
cludes retransmitted traffic (repeated packets).Drop ratio is the total rate of packet 
loss during the simulation time. To obtain network statistics, we measure also the 
drop ratio metric that result in the failure of the receiver to decode the packet and 
simulation time is 100 seconds. Robust TCP is poor in performance as a standalone 
TCP but after adding the ICN it becomes much better (see figure 7) and accurate than 
TCP New-Reno as show in the figure (8). To evaluate our scenario, we compare TCP 
Robust with other TCP variants (TCP New-Reno) by using different metrics that will 
show us clearly the improvement of our TCP version compared to others. (Figure 8 
and 9). 


Robust TCP with ICN — 
Robust TCP without ICN 


Dropratio 


o ® 100 


Time 


Fig. 7. Comparison between TCP Robust before and after adding ICN algorithm. 


The main difference between Robust and New-Reno TCP occurs in the reaction of 
each protocol. In the TCP New-Reno the reaction will be whenever an error or con- 
gestion occurs on the network by slowdown the transmission without being accurate 
if there is a congestion or not. In addition of that the main problem of New-Reno TCP 
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that it suffers from the fact that it takes one RTT to detect each packet loss. When the 
ACK for the first retransmitted segment is received only then we can deduce which 
other segment was lost. This problem of inaccuracy in TCP New-Reno is solved by 
the ICN algorithm that the ICN receive the packet and check the presence of conges- 
tion by using the normal and congestion phase and by adding the timestamp option 
which can be make sure of the presence of congestion or no. The deduction of con- 
gestion in TCP Robust is different from New-Reno, it will be deduced after signaling 
ECN from ICN to TCP robust, and then the TCP reacts by decreasing the transmis- 
sion. This accuracy in detection of congestion can be up to 24 % as difference be- 
tween the two protocols (Figure 8) before reaction of each one and starting slowdown 
retransmission. Due the fast reaction of TCP robust, the transmission of TCP become 
less than in TCP New-Reno which means that the throughput in the TCP robust must 
be less than in New-Reno, this is clear and deduced in the figure 8. 


Robust TCP = ———— 
New-reno TCP 


Dropratio 


Time 
Fig. 8. Comparison between TCP Robust and TCP New-Reno 


In figure (9) represents that the drop ratio is less in Robust than in New-Reno due 
that TCP reacts only when receiving ECN which make its reaction faster. 
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Fig. 9. Comparison between TCP Robust and TCP New-Reno 
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In figure (9) Robust TCP algorithm reaction is faster than the Reaction of New- 
Reno, thus Throughput in New-Reno is higher than when using Robust TCP. Conges- 
tion detection used by ICN algorithm is more accurate when using the timestamp 
option for detecting a spurious timeout which improve more the performance of TCP. 

The main difference between spurious timeout algorithms relies on the method 
how to detect spurious timeout by solving the retransmission ambiguity in many cir- 
cumstances. After clarifying this ambiguity TCP can tell whether the data is there is 
spurious timeout has happened or not. DSACK, F-RTO and Robust TCP can see the 
problem of spurious timeout in different aspects. DSACK, an extension of TCP 
SACK, works it out in the sequence space. It requires the TCP receiver explicitly 
acknowledging duplicate segments with duplicate SACK options. F-RTO algorithm is 
used for detecting spurious retransmission timeouts with TCP. It is a TCP sender-only 
algorithm that does not require any TCP options to operate-RTO delays the decision 
of loss recovery and waits further two ACK. If the first arrived ACK forwards the 
sender's transmitting window, TCP concludes a spurious timeout and resume trans- 
mitting new data. Our approach is different than other TCP by using an algorithm of 
congestion detection outside the TCP code, where it can detect congestion and spu- 
rious timeout by using the timestamp option at the occurrence of loss or congestion 
event. The main advantage of ICN with timestamp algorithm is that it can work with 
spurious timeouts and the others loss events by detecting the congestion in the net- 
work immediately and then directly will be notified to Robust TCP in order that TCP 
after this action will reduce its window, which can improve very well the performance 
of our TCP. 


6 Conclusion and Perspective 


This paper has proposed a new algorithm, which is implemented as a standalone 
component and not inside a TCP stack. This algorithm that interacts with a classic 
version of TCP is able to detect congestion and notify directly the loss to the Robust 
TCP through the congestion notification (ECN) in order to reduce its window which 
leads to a Robust TCP compared to other variants like New-Reno and SACK TCP. In 
our work we demonstrate that congestion event detection can be realized independent- 
ly of the TCP code in sake of better detecting congestion occurring in the network. 
Following this work and the results obtained so far, we are currently planning to de- 
velop more the detection of congestion by using the delay-based in the congestion 
detection algorithm (ICN) and the effect of fast reaction of TCP robust in the Net- 
work. 
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Abstract. In view of some traditional defects, say, incomplete codes, and im- 
perfect criterions on critical values, of IEC 3-ratio-code law in transformer 
faults diagnosis, a novel transformer faults diagnosis method is proposed based 
on adaptive neuro-fuzzy inference system (ANFIS) in the paper. The ridge-type 
distribution functions serve as the fuzzy membership functions in input layer of 
ANFIS, and Fletcher-Reeves (FR) conjugation gradient algorithm with good 
global convergence properties acts as the learning algorithm of ANFIS, the 
learning quality of ANIFS is therefore improved, dramatically. Eventually, the 
test results in transformer faults diagnosis show the validity of the method. 


Keywords: Fletcher-Reeves conjugate gradient method, ridge-type distribution 
function, ANFIS, transformer, fault diagnosis. 


1 Introduction 


Transformers are key pieces in power supply and distribution systems, whose normal 
operation is a basic guarantee for power supply reliability. During the occurrence of 
the faults the insulating material of the transformers are degraded, resulting in the 
generation of gases. The type, the amount and the proportion of these gases depend on 
the type of degraded material, of the responsible phenomenon for the degradation and 
the levels of energy involved in the action. And so, it is possible to characterize the 
type and the severity of the transformer faults through the analysis of the gases com- 
position that find dissolved in the insulating oil. Presently, the broadly applied criteria 
for the fault diagnosis in transformers starting from the analysis of the dissolved gases 
analysis (DGA) in the oil is IEC 3-ratio-code diagnosis law[1]. Later, the law is fur- 
ther improved to suit the code-combination under condition that several faults are 
coexistent, which is named as improved IEC 3-ratio-code law[2]. However, the me- 
thod is inadaptable to the condition that the volume capacities of the gases lower than 
the threshold value. In addition, IEC codes are incomplete and the criteria for critical 
values are imperfect[3]. In present, the methods to solve the problems mostly include 
fuzzy synthesis judgment[4], evolutionary fuzzy logic[5], fuzzy neural network[6], 
neuro-fuzzy hybrid system[7], Kohonen networks[8], adaptive neuro-fuzzy inference 
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system (ANFIS)[9], and et al. In [4], fuzzy logic is applied to solve the “bottle neck” 
difficulty in fuzzy boundary knowledge acquisition. It effectively improves the dis- 
posing abilities of fuzzy information, but fuzzy membership functions and the shape 
parameters are subjectively decided by man beforehand. In [5] the traditional TEC 
3-ratio-code criteria are applied to establish the initial architecture of the diagnostic 
system, including the diagnostic rules and membership functions of the fuzzy subsets. 
After this first step, a genetic algorithm is applied to simultaneously adjust the diag- 
nostic rules and the shape parameters of fuzzy membership function in order to obtain 
the best performance for the group of samples given. However, the process is no self- 
adaptive. When the distribution of the samples space is complicated and comprises 
noise, it is difficult to acquire the fuzzy rules. In [6] fuzzy neural network is applied to 
implement fault diagnosis so as to improve the learning ability of fuzzy information, 
but the fuzzy membership functions, and the shape parameters related to it, are still 
specified by man, subjectively. In [7] a neuro-fuzzy hybrid system that combines a 
ANN with a fuzzy evolutionary expert system, with a purpose of allying the advan- 
tages of high learning and high capacity of non linear map of ANN’s with the explicit 
knowledge represented by the rules of a fuzzy expert system is presented to improve 
the knowledge denotation and inference learning abilities of fuzzy expert system. In 
[8] a Kohonen neural network approach is presented. The application of the technique 
is due to the high capacity patterns classification and low computational cost. In [9] 
an adaptive neuro-fuzzy inference system(ANFIS), that not only incorporates with the 
advantages of fuzzy logic and ANN but also overcomes the flaws of the two, is pre- 
sented to improve the knowledge processing ability and adaptability. But the distribu- 
tion of the samples space is complicated and comprises noise, the learning of ANFIS 
is a difficult task, i.e., it is unsuitable to ANFIS learning. Due to that many improved 
algorithm is proposed. But these methods mostly implement revision and selection for 
input-output data, and don’t resolve the problem, radically. Hence, to improve the 
diagnostic efficiency and intelligence of IEC 3-ratio-code further, and mine faults 
information sufficiently, in this paper, the fuzzy membership functions and algorithm 
of ANFIS are improved so as to construct a high-efficient expert system for transfor- 
mer faults diagnosis, it boosts up the robustness and adaptability of the diagnosis 
system, dramatically. 


2 ANFIS 


2.1 ANFIS Architecture 


ANFIS is an integration of neural network and fuzzy inference system(FIS) as in [10]. 
Sugeno-type FIS is comparatively simple and makes for mathematics analysis, it is 
for that adopted in ANFIS model. ANFIS provides a modeling method based on data, 
and easily incorporates with other optimizing self-adaptive methods to form a fuzzy 
modeling tool with optimizing and self-adaptive ability. Fig.1 shows the ANFIS ar- 
chitecture for transformer fault diagnosis based on IEC 3-ratio-code. 
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Fig. 1. ANFIS structure 


The architecture of ANFIS may be divided as 6 layers. In Fig.1 the Ist layer means 
input layer where X={R\=C,H2/C2Hy, Ro=CH4/H2, R3=C2H2/C2H¢}.The 2nd layer is 
fuzzy segmentation layer where there are 9 nodes, and each node applies ridge-type 
distribution function, whose shape parameters may be adjusted automatically through 
the learning of the networks. The third layer is rule layer where there are 27 nodes 
fully covering universe space, each neuron in this layer is corresponding to single 
Sugeno-type fuzzy rule. The rule neuron accepts its inputs from fuzzy segmentation 
neurons and computes the rule strength expressed by it. The fourth layer is norma- 
lized layer where each neuron accepts all inputs coming from rule layer, and works 
out active strength for specified rule. The fifth layer is reverse fuzzy segmentation 
layer where each neuron connects itself to each normalized neuron, and simultaneous- 
ly, accepts the initial inputs R,, R2, and R3. The sixth layer is a summation neuron 
where all the outputs in reverse fuzzy segmentation layer are incorporated as the final 
output y expressed by 


Ba y: R® a YH ko +k, R +k Ry +k R5]- (1) 


i=l i=l 


where kj, ki, ki2, ki3 is the linear parameters of Takagi-Sugeno-Kang(TSK) equation. 
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2.2 ANFIS Learning Algorithm 


Traditional ANFIS adopts the blend learning algorithm of BP and Least Mean Square 
(LMS), where BP algorithm is applied to decide the shape parameters of fuzzy func- 
tion and LMS is used to confirm the linear parameters of TSK equations. The algo- 
rithm of ANFIS comprises two processes in each iterative cycle, that is, forward 
propagation of information and backward propagation of error. In forward propaga- 
tion, the training set of input patterns presents in ANFIS, the outputs of the neurons 
are calculated layer by layer, where the parameters of the latter item of the rule is 
represented using LMS. In Sugeno-type FIS, the output y is expressed by the linear 
function. Hence, give out the shape parameters of fuzzy membership functions and P 
training sets of the input-output patterns, then P equations are formed below. 


yO=LAOFO+H050+--+4,0f,0 
yy(2)= (2) f,(D+ G2) f2Q)+-+ 4, (2F,(2) 


yy (P)=K (PL (P+ (Df (p) +--+ Ly (P)F,(P) (2) 


yy (P) = L(P)f,(P)+ Ly (P) fy(P) +++, (P) f,(P) 
where 


FQ =Kyy + KX (@ +k, .X,(a) +++ +k, x, (a) - (3) 


am” mm 


In the above equations, m is the number of the neurons in input layer, and n is the 
number of the neurons in rule layer. Assume that the inputs are represented by 
X\(P),°**, Xm(p), and y,(p) serves as the desired output of ANFIS, we then have 


yy = Ak- (4) 


where y, is Px1-dimensional desired output vector expressed by 


ya=a1), yA2), «-- YAP). (5) 


and A is Pxn(1+m)-dimensional matrix expressed by 


(1) (Dx) yx, () 4,0) 4, )x,0) 4, x, (0) 
Hy(2) fy (2)% (2) + Hy (2)% (2) 2+ Hy (2) Hy (2) (2) +, (2% (2) 


HP) (PIX (P) = (PX (DP) (PY Hy P)%(P) i, (P)%q(P) (6) 


LCP) w(P)x,(P) + Uy (P)x,(P) + M,(P) ,(P)x(P) -H, (P)x,(P) | 


and k is the n(1+m) x1-dimensional vector expressed by 


T 
k= [Kio ki, ki a Kin Ky ky, ky i Kom Kno Kn Kio Se kin | (7) 


Transformer Fault Diagnosis Method Based on Improved ANFIS 275 


Usually, the number of the input-input pattern P is larger than n(1+m), and the exact 
solution of equation (4) is possibly inexistent. However, we may find the LMS esti- 
mation k of k that can make || Ak—yd || * least. This can be realized using the follow- 
ing fake-reversion technique. 


k’ =(A’A)'A’y,. (8) 


During learning of the network, the calculating error may be adjusted according to the 
practical outputs and the desired outputs of the system. The mean square error (MSE) 
of the network is expressed by 


1 2 
os reed, ) 
where y,; and y;are the desired value and the output value of the j" node in output 
layer. During learning, centre d,, width oj, weigh w; are adjusted by 
oH 
d,(n+1)=d,(n)—9 = d,(n)—12- (10) 


] 


oH 
0, (n+1) = OO = 0, (n)—nAw f,(w; ~ Vi LX _ d,(n)] / 0; K (11) 
ij 


w, (+1) = wy (1) — Mls Mp, Vy — Y/Y; (12) 
J 


where n is the iterative times, 1 is learning factor, 0<y<1, and 


A=Uyy-¥)/ DWM Me, (21%, - 4 )]/ 0; (0)}- (13) 


2.3 Innovation of ANFIS Learning Algorithm 


In the traditional ANFIS, the adjustment of network parameters are implemented 
through BP learning algorithm or the combination of BP and LMS algorithms. When 
applying BP algorithm, the shape and linear parameters of ANFIS are adjusted at the 
same time. When applying combination algorithm, the shape parameters are adjusted 
through BP algorithm and the linear parameters are adjusted through LMS algorithm. 
Clearly, the two apply BP algorithm. However, BP algorithm has some defects, say, 
low training speed and easily falling into local extremum. Due to that many improved 
methods are presented. They can be generally classified as the two types. One is to 
apply the heuristic technique through analysis on performance function of BP algo- 
rithm, for instance, additive momentum law, self-adaptive learning rate law, and resi- 
lient back-propagation (RBP), and et al. The other is to speed BP algorithm through 
standard numerical value optimization, such as conjugation gradient method, quasi 
Newton method, and Levenberg-Marquart(LM) law and so on. Through a number of 
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analysis and comparison on diverse algorithms in terms of structure, complicity, and 
training accuracy, Fletcher-Reeves(FR) conjugation gradient method is considered to 
be conveniently realized and achieve better performance[11], hence it can be adopted 
in this paper. Standard BP algorithm adjusts the weights along the most rapid direc- 
tion of loss function descending, that is, negative gradient direction. Its single step 
algorithm is expressed as follows. 
Waa = W, an a,G,, = W, —a,Vf (W,) ° (14) 
where W,, is the current weigh vector, Vf(W,,) is the current gradient, and a, is the 
learning rate. 

In conjugation gradient method, the first step of the algorithm is explored along 
negative gradient direction too, i.e., Do=Go=-Vf(W,,), and then, linearly scout is per- 
formed to ensure an optimal motion distance in the current searching direction. 


D,=G,+B,D,.- (15) 


where D,, is the scout direction of conjugation gradient algorithm. In diverse algo- 

rithms the calculation of ,,is different. In FR algorithm, 7, is computed by 
GG, 

G!.G 


n-1 


B,= (16) 


During searching, the direction still required to be corrected, and the algorithm will 
not be ensured to be convergent otherwise. The corrected method is described as 
follows. When the training time is integral times the weigh number, let £,=0. At the 
same time, in every step of the training, if V/(W,, Y'Dn >0, then D,= -Vf(W,,). This en- 
sures that the searching is always implemented along error descending. 


3 Transformer Fault Diagnosis Model Based on ANFIS 


In fuzzy region partition of IEC 3-ratio-code, the boundary points 0.1, 1, and 3 are 
educed based on a lot of trials and analysis on transformer fault examples. Clearly, 
they are statistic and of decentralization. The statistic method inevitably abnegates 
some minor factors, and the precision may be sacrificed. However, fuzzy membership 
function may describe the decentralization phenomenon. When fuzzy mathematics is 
applied to deal with code problems, the key is to construct the fuzzy membership 
functions of the codes. According to knowledge and experience of expert system(ES), 
descending ridge-type distribution function is selected to describe the fuzziness of 
code regions. Thus the fuzzy membership functions of the inputs R\=C,H,/C Hy, 
Rj=CH4/Hb, and R3=C2H>/C2H¢ on code 0, 1, and 2 can be constructed beolw. 

1) Fuzzy membership function of R; 


1 x<0,08 
u(x) =40.5—O5sin[25z(x—0.))] 0.08<x<0.12 ° (17) 
0 x>0.12 
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0 
0.5+0.5sin[252(x-0.1)] 
u,(x)=41 
0.5 —0.5sin[52(x-3)] 
0 
0 
u(x) =40.5+0.5sin[52(x—3)| 
1 


2) Fuzzy membership function of R, 


0) 
0.5+ 0.5 sin[2577(x —0.1)] 
Vy (x) =41 
0.5-—0.5 sin[5z(x—-1.0)] 
0 
1 
v,(x) =40.5—-0.5 sin[252(x—- 0.1] 
@) 
0 
Vv, (x) =40.5 + 0.5 sin[5z(x —1.0)] 
1 


3) Fuzzy membership function of R; 


1 
Wy (x) = 4 0.5—0.5sin[577(x —1.0)] 
0 
0 
0.54 0.5sin[S5z(x—-1.0)] 
w(x) =41 


0.5 — 0.5 sin[5S7(x — 3.0)] 
0 


x < 0.08 

0.08<x<0.12 
0.12<x<2.9 
2.9<x<3.1 


x >3.1 


x<29 
2.9<x<3.1 
x>3.1 


x < 0.08 

0.08 < x < 0.12 
0.12<x<0.9 
0.9<x<l1.l 
x>t1.l 


x $0.08 
0.08<x<0.12- 
x>0.12 


x<0.9 
09<x<l.l- 
x>tI.l 


x <0.9 
0.9<x<l1.l- 
x>tLl 


x<0.9 
0.9<x<1.1 
L1l<x<2.9 ° 
2.9<x<3.1 

x>3.1 


277 


(18) 


(19) 


(20) 


(21) 
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(24) 
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0 x<2.9 
w,(x) =410.5 + 0.5sin[52(x—3.0)] Megesi« --) 
1 x>3.1 


4 Example 


270 samples are randomly drawn out from the DGA diagnostic knowledge base of the 
no.1 main transformer in one substation to train ANFIS. These samples are averagely 
classified as 9 groups corresponding to 9 fault-types of IEC code table. Below we take 
the fifth fault in IEC code table, i.e., low temperature overheating, as an example to 
illustrate modeling procedure of ANFIS. Let 30 samples present low temperature 
overheating fault, and the corresponding outputs be 1, the rest 240 samples be other 
type’s fault-type and the outputs be 0. Finally, some data are randomly drawn out 
from the rest data in data base as checking data to perform model affirmation. Fig.2 
shows the change condition of the step length during training process. 
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Fig. 2. Curve of training step-length change 


Fig. 3 shows the error graph during learning. 

Seen from Fig.3, applying the descending ridge-type membership function with its 
parameters given, the system may rapidly converge, the computing cost is, due to that, 
saved to very extent. 

To test the validity of the model, some data are randomly selected to serve as 
checking data that they don’t participate in training. Fig.4 is the fitting results of the 
checking data of ANFIS. 
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Fig. 4. Fitting test data 


Below a comparison is made between the improved ANFIS and traditional ANFIS 
learning algorithm. 


Table 1. Transformer faults indentification results. 


Num. of samples tested % Accurate 
Input data Correct Improved Traditional Improved Traditional 
diagnosis ANFIS ANFIS ANFIS ANFIS 
Fault-type 0 sample 30 28 19 93 63 
Fault-type 1 sample 30 27 16 90 53 
Fault-type 2 sample 30 28 20 93 66 
Fault-type 3 sample 30 27 23 90 16 
Fault-type 4 sample 30 29 22 96 73 
Fault-type =) sample 30 28 22 93 73 
Fault-type 6 sample 30 27 20 90 66 
Fault-type 7 sample 30 27 23 90 16 


Fault-type 8 sample 30 28 21 93 70 


280 H. Su 


Seen from Table1, compared with traditional ANFIS with BP algorithm based, the 
improved ANFIS algorithm shows higher accuracy and excellent property. 


5 Conclusions 


ANFIS can automatically acquire fuzzy rules through training samples and overcome 
the defects of fuzzy inference system(FS) that excessively relies on knowledge and 
experience of experts. ANFIS applies FR conjugation gradient algorithm to effective- 
ly overcome the flaws of BP algorithm and speed the convergent speed. The spot 
applications prove that mentioned ANFIS in this paper applied in transformer faults 
diagnosis is feasible. ANFIS is a self-adaptive system and can rapid track input-output 
data samples to adjust own parameters, it can be so used to investigate on-line faults 
diagnosis technique for the transformers, further. 
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Abstract. In this paper, a large number of studies which are about the typical 
residential basements have been done in Haidian District, Beijing City, China. 
Under the national regulations and the power of relevant authorities, we still 
regard the security risks of an electric circuit fire, in the first place, as a result of 
wrongly installment of electrical circuits. In this article, these risks have been 
carefully analyzed as well as the corresponding solutions being proposed. At 
the same time, we hope that an increasing attention will be paid to the 
residential basement and those users. 


Keywords: Beijing City, residential basements, electrical circuit, circuits fire, 
electric design. 


1 Introduction 


In Beijing, the basement as a rental lease, the residence of the phenomenon is 
widespread. Some of the basements are constructed for residence and they are used 
directly as a part of the building, the other residential basements are lately converted 
from underground storerooms or underground parking lots. The basements provide 
domiciles for low-income workers in catering, logistics, cleaning and other fields in 
Beijing, nonlocal students and job seekers living in Beijing for a short time. 


2 Current Situation 


Because laws and regulations are not sound and the management system is not 
smooth[1], living in the residential basement deals with a variety of security risks, 
such as entrances, corridors and other public spaces blocked with all kinds of debris 
which leads to no access to disperse the crowd when an emergency occurs, and no 
access control systems to avoid the theft events[2]. After conducting researches of the 
rental residential basements in Haidian District, Beijing City, it is obvious that the 
design and construction problems of electrical circuit contribute most to the common 
electric fire. 
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2.1 Inhabitants 


Those people who live in basements are the city’s low-income workers, as a 
consequence, most of them are lack of necessary knowledge and have little command 
of living safety, especially the use of electrical equipments. 


2.2. Living Environment 


The residential basements can be divided into two categories, semi-basements or 
those entirely built underground. Furthermore, the basements constructed fully under 
the ground also consist of two kinds, negative one or two. They have no chance to 
acquire the sunshine. Due to the features of sunshine and air flowing, the lower of a 
basement level, the darker and more humid the house will be, so the less rental is 
charged. The low-incomers’ choosing living in basements results from the valid fact 
that they are comparable cheap to rent in big cities like Beijing. According to the 
investigation of the residential basements near the Fourth Ring Road in Haidian 
District, the monthly rental for each semi-basement room (10 square meters 
residential area on average) varies from RMB500 to 1200, those entirely beneath the 
surface are charged from RMB200 to 500 per month (10 square meters residential 
area on average), to the lowest, the monthly rental of a negative two basement is 
RMB150. 

Because of the low price, characters of the residences and the poor knowledge of 
the tenements in safe basement living which have already drawn the public attention, 
cause the cheat on labor and materials and short of maintaining so as to get both the 
living quality and residential environment into stuck initially. 

The weather in Beijing is hot and rainy in summer, cold and dry in winter with a 
short period of spring and autumn. Take the statistics in 2007 for example, it is an 
average of 14 centigrade annually, negative 7 to 4 in January and 25 to 26 centigrade 
in July. The extreme low record is negative 27.4 centigrade as well as the highest is 
beyond 42 centigrade. 180 to 200 days are without frost all year round with a shorter 
period of western mountain regions. In addition, the average amount of raining is 
483.9mm, whose rainfall is one of the most ones in northern China.[3]As a result, 
humidity is one of the regional weather characters. Worse still, humid environments 
do harm to the safe use of electrical circuits. Besides, those basements with common 
shared toilets, laundries and bathrooms lead to a large wet environment, which calls 
for our consideration in electrical design and construction, as well as the living safety 
of residences. 


3 Formation of Electrical Circuit Fire of the Residential Basement 


Electrical circuits consist of lines and cables, and electrical lines are long pieces with 
lots of branches. They have many opportunities to contact with combustible material 
and wire cable insulation materials are mostly combustible organic materials. When 
the cable or local overheating or the insulation reaches a certain temperature, it will 
burn and cause fire accidents result from the rapid spread of flames, but also produce 
large amounts of toxic gases such as hydrogen chloride and carbon monoxide which 
will cause greater harm to the human body. [4] 


Causes and Solutions of Electrical Circuit Fire of the Residential Basements 283 


In Western Europe, the electricity consumption per capita is tens of times greater 
than the one in China. But in Western Europe, electrical fire only accounts for a few 
percent of the total fire, which is in sharp contrast when compared with that of China 
which is reaching more than 50%. [5] 

Also, according to the fire statistics of the whole nation in the first half of 2009 
provided by the Fire Department Ministry of Public Security, among the direct causes 
of fire accidents, the wrong electrical installation caused fire accidents up to a total of 
19,852 accounting for 26.8% of the total. Second, careless use of fire caused 16,119 
fire accidents making up 21.8%. In addition to the two main reasons, the fire playing 
caused 8,138 fires accounting for 11%, 5,727 fires caused by smoking accounting for 
7.7%, advertence of operation caused 3,171 fires accounting for 4.3%. Also, 
spontaneous combustion leaded 1,418 fires accounting for 1.9% and lightning, static 
electricity and other causes caused 12,385 fires accounting for 16.7%. Meanwhile, the 
fires whose causes are unknown and under investigation are 7,134, accounting for 
9.6%. [6] 

After in-depth analysis of some typical residential basements and its tenants, we 
found that the main causes of the electrical fire are human factors. For example: on- 
site Operator is not in strict accordance with the rules of construction, some staffs are 
even not equipped with the appropriate qualifications; the regulators pay far less 
attention than then should and electrical facilities are lack of necessary inspection or 
maintenance management; the poor electrical knowledge and safety awareness lead 
the tenants randomly change the protecting devices to meet the requirements of some 
electrical facilities, causing a large burden of electrical wiring which makes the 
devices operate irregularly. 


4 Causes of Electrical Circuit Fire of the Residential Basement 


Constraints such as cost control, when designing or constructing the residential 
basement, the electrical circuits are often one-sidedly for the sake of conserving 
materials and other resources, especially for the basements for living purpose adapted 
from warehouses or parking lots. Since the absence of heaters in such basements, 
most tenants will use high-power heating equipments in winter, which will result in 
frequent tripping of electrical circuits. More seriously, the long-term overload of these 
circuits and the electrical insulation will lead to electrical accidents. According to 
statistics, China's electric fire holds the first place because of the following reasons. 


[7] 
4.1 Low Standard of Electrical Circuit Design and Small Circuit Capacity 


When designing the electrical circuits, mode selection is usually relevant to the lower 
limit of the circuit capacity, without a great foresight. The new national standard, 
"Residential Design Code" (GB50096-1999) put into effect in 1999 has published 
some requirements as follows. Electrical lines should be consistent with fire- 
resistance requirements of laying wire. Copper wire should be used, and each line into 
the family home should not be less than 10mm” and the branch circuit cross-section 
should not be less than 2.5mm’. The power outlet sockets of kitchens and bathrooms 


284 Z. Xu, Y. Lu, and N. Han 


should be an independent circuit. Every housing unit should be equipped with a 
power circuit breaker, and should not touch the phase line and the neutral line 
switching equipment at the same time. [8] But it must be noted that these standards 
are still the lowest ones in residential electrical design requirements. 

Due to the specialties of the development and construction of the residential 
basements, their electrical circuits are not only much different from the residential 
units above ground, but also not in the same with those of public institutions. Such 
basements do not have full-time professional electrical maintenance personnel, and 
tenants do not understand electrical knowledge and have no ability to maintain 
electrical safety. Therefore, when some problems occur, fire accidents are prone to 
merge. Coupled with the design requires electrical safety, functionality and adaption 
to the development, electrical circuit design of residential basements needs more 
long-term plans. 


4.2 Negligence of Installation and Maintenance of Electrical Circuits 


When laying the electrical lines with poor planning or in order to save lines, some 
wire connectors are in the tubes, and some wires go out of the protective casing 1m to 
2m then expose to other objects, and simple wire does not twist as insulated. These 
situations will sooner or later cause a fire disaster. Insulated wires are laid directly 
using ordinary non-flame-retardant plastic casings. As soon as there is a spark, the fire 
may easily spread quickly and then becomes a fire disaster. [9] 

In the researching process, particularly in toilets and bathrooms and other damp 
places found that there is no insulation for the insulated to handle the situation. What 
is worse, when part of the electrical power system is running, the connector is directly 
exposed which is not only make it easier to pull the private wire, but also prone to 
electric shocks threaten people’s safety or electric fire caused by a short circuit. 


4.3 Poor Operating Condition of Electrical Circuits 


The basement is very damp, combined with insufficient ventilation set reasonably. 
Electric leakages are very likely to occur. And because of lacking in knowledge of 
electric fire, tenants often accumulate flammable debris such as useless boxes and 
waste newspaper in the channel and other public spaces. On one hand, it makes it 
easier for the fire disaster to take place when an electric leakage occurs. On the other 
hand, due to the accumulation of such debris blocked the channels making it difficult 
to escape when a fire emergency comes into being, it is not hard to lead a more 
serious personal and property losses. 


5 Solutions of Electrical Circuit Fire of the Residential Basement 


5.1 Design Reasonably and Focus on Residential Electrical Fire 


With economic development, people’s power demand increases, making the design of 
electrical circuits guaranteed in safety and reliability at least 20 to 30 years. In terms 
of wiring should be noted: it is much likely to set a fire when wiring with aluminum 
than doing with copper. The placing of lines should be the mean of concealed way 
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which is combined with both beauty and safety. Increasing the number of sockets to 
remove all kinds of security risks and set a reasonable number of residential branch 
circuits can ensure the quality of work. [10] Meanwhile, the high-power equipments, 
especially the heating equipments used for long periods in the residential basement 
when winter comes, should be set a single loop for each. Last but not least, setting a 
fire alarm device is also very important. 


5.2. Standard Installation and Adaption of the Electrical Circuits 


Creating a safe environment are positive measures to reduce electrical fires. Standard 
installation and alternation is the first step to cut off a fire from the source which 
greatly avoids the line aging, overloading and other hazards. This requires the 
electrical design, construction and management to work together. And the 
professional ability of the personnel should be strengthened with training and 
practicing. 


5.3 Regular Testing of the Existing Electrical Circuits 


Fire agencies should conduct regular testing for the electrical circuits. It is also 
necessary to establish regular electrical safety testing system. Regular testing should 
be managed by professionals to detect random access, damaging and aging 
phenomena which can effectively prevent the occurrence of fire accidents. 


5.4 Improvement of the Relevant Regulations and Management 


Although the system of residential design has been basically established, the laws and 
regulations of living in the residential basement are still not authoritative and aimed 
enough. Secondly, for this special residential environment, it lacks in effective 
management of organizations, and even can be regarded as a management blind spot. 


5.5 Popularization of the Electric Knowledge 


Electrical fires of residence are caused mainly by the weakness in electrical fire 
knowledge. All of the levels in government and community should provide useful 
accesses to knowledge of electrical circuit at the very beginning. The tenants need 
regular and long-term education of the proper use of electricity, such as posting a 
variety of easy-to-learn knowledge of fire illustrated, and organizing professionals to 
explain electric knowledge to the tenants in their tenements in order to promote safe 
use of electricity. 


5.6 Active Use of New Materials and New Technology 


In new construction projects, it is better to utilize more new materials and modern 
technology, such as the use of flame-retardant or fire resistant wires, cables, pipes 
with fire blocking materials, electrical leakages and overload protection with circuit 
breakers and so on. [11] 
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6 Conclusion 


The residential basements provide a shelter for more than 10 million residents in 
Beijing. Along with the development of Beijing, the status of the residential 
basements will play an increasingly critical role for population from other places, and 
therefore such basements require more staffs in relevant fields like electrical 
engineers to pay attention. What is more, the construction and renovation of electrical 
engineering of the residential basements calls for improvement of reliability and 
security urgently, as well as the supplement of the load for the long-term with 
reserving margin. 
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Abstract. In Switched Reluctance Motor, the basic control issue is production 
of ripple free torque. The presence of ripple in the torque leads to production of 
undesirable noise and undesirable vibration during the operation. The static 
characteristics along with the magnetization pattern of the individual phases 
dictate the amount of torque ripple during operation. Concept of Electronic 
Control approaches for dynamic torque ripples are extensively reviewed and 
presented. 
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1 Introduction 


Switched Reluctance Motor (SRM) has become popular in Industrial and domestic 
application due to its high torque to inertia ratio, high efficiency, low cost, variable 
speed operation and good dynamic response. The primary disadvantage of SRM is 
torque ripple in the generated torque, leading to acoustic noise and mechanical vibra- 
tion, which must be kept within the permissible limits. The torque ripple is not toler- 
able in Direct Drive applications. There are primarily two approaches for reducing the 
torque ripple; one method is to improve the magnetic design of the motor, while the 
other is to use sophisticated electronic control techniques. In magnetic design, 
the reduction in torque pulsations is obtained by changing the stator and rotor pole 
structures with some penalty on the motor torque. The electronic approach is based on 
optimizing the control parameters, which include the supply voltage, turn-on and turn- 
off angles, and current level. 

Firstly, the cause of torque ripple in SRM is mainly due to the switching of phase 
currents into its windings and the highly nonlinear nature of phase inductance varia- 
tion when the rotor rotates. When the successive phase windings are excited in 
sequence to produce continuous rotation, the total torque is the sum of the torque gen- 
erated due to the currents in the outgoing phase and the current in the incoming phase, 
which are controlled independently. Torque pulsations are encountered at these in- 
stants of current switching from one active phase to the other. From the following 
Equation (1), it is observed that if improvements in torque profile are to be achieved, 
the excitation current i and /or the phase inductance L (0, i) are to be modified. 
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T(6,i) = 1 j2 oL(6,i) (1) 
2 00 

Therefore, the torque depends upon the torque — current — angle characteristics of the 

machine. Secondly, the absence of reference frame transformation for SRM leads to 

dependency of rotor angle for torque production. 

Torque ripple is defined as as the difference between the maximum and minimum 
instantaneous torque expressed as a percentage of the average torque during steady 
state operation. Mathematically, Percentage Torque ripple is expressed as given in 
equation (2) 


Legend: 
—S— : Phase current=40 Amp. —— : Phase current=30 Amp. 


—>— : Phase current=20 A : : Phase current=10 A 


Torque produced (N.m) 


Rotor position (degree) 


Fig. 1. Torque — current — angle characteristics of SRM 


Timst (max)— Tymst (min) 


Torque Ripple (%) = (2) 


Ta ve 


Where T),s; is the instantaneous torque produced during every switching and T,,, is 
the average torque value. The torque ripple is evaluated from the torque dips in the 
static characteristics. Torque dip occurs at the overlap region during which the two 
overlapping phases produce equal torque at equal levels of current. When the incom- 
ing phase becomes deficient in supplying the required torque, torque dip occurs. The 
period of overlap is related to the no of strokes per revolution or step angle s. 
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21 
Step Angle ¢ = NN paNrep (3) 

Where N, is the number of rotor poles, Np, is the number of phases and N,.p represents 
the number of pole pairs per phase. A low value of step angle will increase the over- 
lap angle and also influences the frequency of control per revolution. Higher number 
of strokes would lessen the torque ripple with high value of rotor poles Nr which 
would decrease the rotor saliency, increase VA rating of the inverter, switching fre- 
quency and copper loss [1]. A lower torque ripple would improve the average torque 
produced. The number of strokes per revolution could be increased by larger N, with 
the penalty in the saliency ratio. In [3], the author had discussed about the choice of 
number of rotor poles. Increasing N, would increase the voltage ampere rating of the 
controller used. The core losses will also increase due to higher frequency switching. 
Torque ripple occur mainly in the overlap region as the torque production shifts from 
one phase to another. The value of overlap angle is given by the formula. 


2m 
N,N, (4) 


Overlap angle = min(,.B;) — 


As the amount of overlap increases with N, and N,, one would try to maximize these 
values. Maximising the value of N, and N, will lead to complexity in the converter 
design and would limit the operating speed of the machine. It also depends upon the 
stator pole arc Bs and rotor pole arc B,: 


This paper presents a detailed survey of various electronic strategies implemented 
for Torque Ripple Minimization. 


2 Electronic Control Approach 


2.1 Torque Sharing Function 


One of the efficient methods for minimization of dynamic torque ripples is to track 
the current / torque produced in every individual phase by a Torque Sharing Function 
(TSF). The TSF are function of turn on angle T,,,, overlap angle 0,,, turn off angle Ty 
and motor speed o. 


Hysteresis 
current 
controller 


Power 
converter 


Fig. 2. Torque Control using TSF 


The torque/flux/current controllers tracks the expected value of torque/flux/current 
based on TSF. With TSF, the SRM drive operates either on hysteresis or PWM con- 
trol. In order to maintain the desired instantaneous torque, a high bandwidth current 
regulator is need. The TSF defined could be linear or nonlinear. In the paper [3], a 
rising and falling exponential TSF, referred as m function relating to rotor position 0 
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has been used. The reference phase voltage is obtained from rotating frame transfor- 
mation technique under constant speed. The limitations in this technique are the as- 
sumption of magnetic linearity, constant speed operation and converter requirement 
for obtaining the spiky applied voltage. In the paper [4], Schramm et.al proposed a 
linear TSF in which the torque varies linearly during the commutation and the value 
of current in both incoming and outgoing phases are equal at the central commutation 
point. A high torque to current was obtained from this TSF. The nonlinear TSF like 
sinusoidal TSF and Cubic TSF were developed to obtain the smooth control and high- 
est torque to current ratios [5, 6]. Xue et.al has carried out a critical evaluation of all 
four TSFs for torque ripple minimization with rate of change of flux and rate of 
change of current to be the evaluation criteria [7]. He further proposed the Torque 
Ripple Factor (TRF), a performance index to evaluate the effectiveness of TSF for 
ripple reduction. Genetic Algorithm is used to optimize the all the four TSF for mini- 
mum TRF and concluded that the cubic TSF yields less in both TRF and computation 
time. Thus, it can be inferred that nature of function and its interdependency on rotor 
position decides the smoother and ripple less torque production. 


2.2 Current Profile Strategy 


From equation (1), it can be understood that the value of torque is dependent on the 
phase currents and rotor position and it is evident to control the phase current profile 
through wave shaping technique for reducing the torque pulsation. In the paper [8], a 
self learning technique in which an accurate value of reference current required for 
obtaining the desired torque for various rotor positions could be obtained by physical 
measurements and stored as a loop up table. The total time taken for computation and 
conduction of test is very high. In the paper [9], Lovatt et.al has described for the first 
time a method for finding the optimum current waveforms for a given power and 
speed for an SRM with limited supply voltage and limited peak current, using com- 
puter search techniques. A prior knowledge about the static characteristics of the mo- 
tor is required in LDT approach while determining the reference current [10, 11]. 
Reference [12] showed that the motor torque ripples could be effectively cancelled by 
injecting the harmonics of amplitude and spatial phase into the reference current at 
appropriate frequency. In the paper [13], the coefficient of added amplitude and spa- 
tial phases were found plotting surface of ripple variance. The surface was optimized 
for minimal value by Gradient Descent method in online process. It was found effec- 
tive in reducing steady-state torque ripple, given a feedback signal proportional to 
torque ripple, over the full speed range. In reference [14], the author designed an ef- 
fective PI controller for tracking the pre-optimized current references. A digital solu- 
tion (Microcontroller or Digital Signal Processor) for hysteresis control eliminates the 
use of current feedback filters and limits the bandwidth of the current control loop. 

As the developed in SRM is proportional to the square of the phase current, Nihat 
et.al presented a method for modulation of phase currents ier [15]. Such modulation 
leads to decrease in torque dip. Due to hysteresis current control, the system expe- 
riences subsonic noise which could be reduced by reducing the controller bandwidth 
and increasing the switching frequency. The above modulation strategy is imple- 
mented with Sliding Mode Control (SMC) for the speed control of the drive [16]. The 
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SMC used, adjust the i,.¢ and torque dips are eliminated with the modulation strategy. 
The low frequency vibration in torque has been eliminated by SMC. 

In the paper [17], the author proposed a nonlinear internal model control in which 
the current error is passed through LPF and is added to compensate for plant model 
mismatches. Sahoo et.al proposed an iterative learning controller (ILC) for solving 
tracking control problems in nonlinear systems with limited plant knowledge, where 
the task is performed repetitively. In the above proposed controller, a feed forward 
control scheme together with a proportional current controller was employed to im- 
prove the tracking performance [18]. The ILC stores the control input and the plant 
output error at each operation cycle, and the control input was updated according to a 
learning law. The learning law ensures that the error is reduced from cycle to cycle 
until the desired level of accuracy was achieved. In the paper [19], the author pro- 
posed a novel technique Fuzzy Iterative Technique (FIT) modulates the reference 
current waveform iteratively by the use of the multiplying factor determined by fuzzy 
systems using torque error and rotor position as the two inputs. The FIT computes the 
incremental change (correction term) for the current profile at each iteration. 

In the paper [20], the current profiling is carried out from flux linkage profiling or 
magnetization characteristics of motor using two dimensional B-spline neural net- 
works (BSNN). The system proposed with BSNN does not necessitate the use of high 
bandwidth current controller and a torque sensor for initial training of the network. 
Henriques et al. [21] suggested a new method for shaping the motor phase currents to 
minimize the torque ripple using a neuro-fuzzy compensator. In this method, a com- 
pensating signal was added to the output of the proportional integral (PI) controller in 
the current regulated speed control loop. In [22], the authors developed a fuzzy sys- 
tem for maintaining the speed constant by controlling the current waveform. The ref- 
erence current was then modulated by subtracting the output of the fuzzy system from 
the sum of phase currents computed at the previous sampling period. In [23], the au- 
thor formulated a multi-objective optimal design problem for generating the reference 
current using GA. The results show that the four design parameters can be automati- 
cally selected by GA and much smoother current waveforms are generated when 
comparing with conventional TSF design using heuristic knowledge. In paper [24], 
authors proposed the concept of context based emotional learning for computing the 
reference phase current for maintaining the speed constant and inherently reducing the 
torque ripple. 


2.3 Commutation Strategy 


One of the reasons for torque ripple in SRM is rapid switching of phase currents into 
its windings, the method of switching the currents are performed either at optimal turn 
on (7,,) or turn off (T,,) angle. In paper [25], Iqbal et.al, devised a hybrid controller in 
which the concept of balanced commutation algorithm is infused in TSF with the con- 
trol of Central commutation angle 8,, which is varied between two limits as function 
of speed. In [26], the magnetization duration depends on turn off (7,) angle, an of- 
fline calculation for optimum torque control at both chopping and full wave mode can 
be determined with an analytical model. In [27], the author proposed a compensation 
technique, by varying the value of turn off angle as a function of current and speed, 
using fuzzy logic control proved it to be effective in reducing the ripple. It was an off 
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line process and cannot be implemented for real time system due to difficulty in 
dynamic measurements. In paper [28], Fisch et.al presented the Pareto optimal opti- 
mization technique using GA for maximizing the value of turn on and turn off angles. 
The optimized firing angles were applied to an inverse model of SRM. 


2.4 Multiphase Operation 


The torque ripple produced in a unipolar operation is more. The total torque is the 
sum of sequential torque pulses produced by each phase alone which leads to exis- 
tence of quite large torque ripple. Multiphase excitation of Switched reluctance motor 
with a special accent on three phase operation would certainly reduce the torque rip- 
ple [29]. Chris et.al analyzed the distribution of magnetic forces with multiphase exci- 
tation. Bipolar Excitation has produced more number of Short Flux Path Excitation 
(SFPE) which improves the average torque produced and reduces the core loss, vibra- 
tions and acoustic noise in the motor [30]. The Bipolar excitations to the phase 
winding improve the current profile and reduce the total harmonic distortion (THD) 
by 26.77% [31]. In a two phase excitation model discussed in [32], the torque ripple 
minimization is carried out using Fibonacci and exhaustive search methods. It was 
observed that Fibonacci and exhaustive search yielded closer results and ripple reduc- 
tion is nearly 50% compared to the traditional constant current scheme. The Fibonacci 
search requires less storage space and it is preferred. 


3 Conclusion 


In this work, a review of the ripple reduction strategies in SRM is presented. It is ob- 
served from the study that the torque control should be on instantaneous basis for dy- 
namic torque ripple reduction. The multiphase operation accounting mutual inductance 
and extension of Random Pulse Width Modulation (RPWM) in SRM are in progress. 
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Abstract. Cameras’ miniaturization enable us feasible to capture photos and 
video at anywhere and anytime, even we wear them on human body and look the 
world in egocentric point of view. However, image quality degradation is caused 
by various factors in wearable environment, preprocess image acquired in 
wearable vision system is an unavoidable step. In this paper, we propose a new 
scheme to preprocess image sequence according to different quality of image 
frames in contiguous sequence. Firstly we detect image quality using hybrid 
methods, then classify them into some categories and given corresponding pre- 
process policy. We present experiment on a dataset collected within a cluttered 
environment, a prototype implementation validates its validity. 


Keyword: wearable vision, preprocess image, blur detection, motion blur. 


1 Introduction 


Two technology’s development drive the emerging wearable vision[1],[2],[3]. One is 
the cameras’ miniaturization, so we are feasible to capture photos and video at any- 
where and anytime. The other is development of wearable computing, so we easily 
wear the cameras on person’s body and looking out at the world. 

Capture high quality in wearable environment is a hard and challenge task, because 
the quality of image via wearable camera is relate to the motion of body, camera and 
human body’s movement easier cause image quality degradation. There are many 
image quality problems are caused by the constantly and arbitrarily moving wearable 
cameras. Some sample images with degradation were acquired by head mounted 
cameras is showed in Fig. 1. 


Fig. 1. Acquired image quality problems via wearable cameras: image motion blur, magnifying 
scaled face with out of focus; excessive light. 
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The decrease of quality image even let the further image analysis doesn’t work, 
object detection or recognition all need good quality image. Though the wearable 
camera could be consciously controlled by the user in reality application, for instance, 
key parts of the body well-positioned may help shoot good object and may help sim- 
plify vision problem. But it is notice that for continuous video sequences, most standard 
technique of object detection is simply to treat each frame as a still image, sometimes 
an object found in frame n+1 is the same object found in frame n, if the object doesn’t 
move or detect wrong object, that means the system will keep on detecting and con- 
suming vast resources all the while. So in resource-limited wearable system, we should 
consume fewer resources as effectively as possible to extract more useful information 
from the best quality image in frame sequence. Thus the key question is how to fast 
detect best quality image in frame sequence as showed in Fig.2. 


Fig. 2. Four continuous video frames captured from wearable camera, the question is which one 
is the best quality we should prior process? 


In this paper, we propose a new schema for pre-processing image in wearable vision 
system, the purpose is help resource-limited vision system to select the best quality 
image and repair lower quality image. Considering motion blur is the common problem 
in wearable computing, so we only pay close attention to motion blur in our first step 
work. 

The organization of the remainder of this paper is as follows: the next section de- 
scribes the schema design and pre-process strategy, then implementation of crucial 
module is presented in following section, mainly introduces relate works of blur de- 
tection and the special solution method for blur identification in wearable system, 
simulation section presents an example use case using the proposed schema and me- 
thod. Finally, the conclusion and outlook provides summary and directions for ongoing 
and future work. 


2 Schema Design 


In this section, we designed series requirement policies for image preprocess in 
wearable vision system, the objective of which is to consider all preprocess requirement 
and select the best quality image for preprocessing different image according to dif- 
ferent quality. The whole requirement policies are showed as follows: 


1) When image is captured by wearable vision system, do image graying and blur 
detection, If image is no blur detected then enter the queen of image process, oth- 
erwise calculate the extend of blur and automatically sorted by blur extend. 
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2) According to application requirements and computational power, system will 
preset threshold of blurred images, the threshold will filter blurred image into 
different process policy: direct handling, handling after repair and cannot be 
repaired. 

3) Little blur image could directly handling in the pending queue and was given priority 
2, repairable image should do image inpainting and was given priority 3 in pending 
queue. Unrepairable image should adopt discarding, saving or sending to server 
policy according to application requirement. 

4) All the images on the queue are in order of priority and waiting for the exposure and 
light balance of pre-processing methods, if in the previous the same scene of frame 
has been get recognized results, the remaining images in the queue as needed to save 
or discard. 
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Fig. 3. The framework with process module of image quality selection policy. 


According to above preprocess strategy, we have designed the pre-processing 
framework (see Fig.3) of selecting the best quality image. Through this framework we 
can capture the image and divide them into four categories: high-quality image without 
blur, good quality image with little blur, repairable low quality image and unrepairable 
bad quality image. In wearable vision system, motion blur is the most common de- 
gradation, so the key step of preprocess execute image blur detection. 
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3 Image Blur Detection 


3.1 Previous Work 


Motion blur is a common image degradation phenomenon in digital cameras, so many 
technologies have been proposed to enhance the quality of image, for instance, stabi- 
lized lens and shift-CCD/CMOS are popular and expensive hardware solution, and 
other software approaches for blur detection are proposed in recently years. Tone et 
al[4] proposed a scheme that makes the use of the ability of Harr wavelet transform in 
different types of edges. Reeves S et al. [5] simplified the identification problem by 
parameterizing the PSF and described a parameterized blur estimation technique using 
generalized cross validation, but computation complexity is high. Rooms et al. [6] 
assumed the PSF can be modeled with a single parameter, and they used a Gaussian 
function to estimate the single-parameter PSF from a single observation of a degraded 
image. Fergus et al. [7] adopted a variational Bayesian approach to estimate the PSF of 
an image. Chong et al [8] proposed a method based on the analysis of extrema values in 
an image. Hsu et al [9] propose a blur detector based on support vector machines to 
estimate the blur extent of an image. Ko et al [10] detect the blurred image by con- 
structing the Bayes disciminant function[11] with the statistics of the gradients of the 
input image. 


3.2 Proposed Method 


In real application of wearable vision, wearable camera’s motion relate to body’s 
movement, so usually more than one degradation function affects the image blur. we 
hope system could fast detect the quality of image frame, so in this section, we propose 
a low-cost and simple approach using the statistics of image gradients to detect blur and 
classify image sequence into different prior label class[9][10]. 

In this paper, we calculate Magnitude of Gradient using first order derivative of 
Gaussian with 2-D Gaussian kernel. When Sigma=1.0, outputs Fig.4, which showed 
the comparison of good quality image, horizon motion and vertical motion blur image 
in original RGB image, gradient magnitude of the corresponding image and corres- 
ponding histogram image. Comparing Fig.4-A2, B2 and C2 and Fig.5., we intuitively 
notice that once the image is blurred, its amount of gradients is decreased and the mean 
and standard deviation are also decreased. Fig.3-A3, B3 and C3 shows the histogram of 
gradients image, we notice that the gradient magnitude distribution of a blurred image 
is almost empty on the large values and there only exists small values. This suggests 
that the statistics of the image gradients can be used for detecting the blurred images. 

In Fig.5, we present the distribution of comparison of Sigma=1.0 and Sigma=0.2. 
Obviously, blurred images have smaller the mean and standard deviations than the 
sharp in this distribution, and smaller sigma distribution could be easier classified. With 
these statistics, we use the Bayes linear discriminant function [10][11] to detect images 
for two class: blurred and no blur images. 
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Fig. 4. Comparison of good quality, horizon motion blur and vertical motion blur images in 
corresponding gradient magnitude and histogram image. 
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Fig. 5. Distribution of the statistics of the gradient magnitude: no blur image, horizon motion 
image and vertical motion image. (left) sigma=1.0 (right) sigma=0.2. 


4 Simulate Experiment 


As mentioned in section 2.1, we use 100 blurred images and 100 sharp images captured 
by wearable vision system as the training data, see Fig.6. From this training data, we 
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can calculate the mean vector and the covariance matrix of the input vector x=[stdev 
mean]' for the blur and no blur classes, and apply[11] to detect the blurred images. We 
prepare 125 test image sequence frames (see Fig.7) to examine the proposed method 
and the result of accuracy rate is about 80.8%. 


Fig. 5. The training data with (1st row)different direction blur images and (2th row) no motion 
blur images. 


Fig. 6. The test image sequence with different scene were captured in wearable vision system. 


5 Conclusion 


In this paper, we propose a new schema for pre-processing image in wearable vision 
system for the goal of selecting the best quality image and repairing lower quality 
image. Considering motion blur is the common problem in wearable computing, so we 
realize the motion blur detection in our first step work. We have provided experimental 
results showing the proposed scheme is efficient. A future extension of our work will 
be to optimize the blur detection classifier and stabilize the motion blur images. 
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Abstract. Wrapper roll is one of the most important components in rolling coi- 
ler. And it has great relation to the quality of the final products. Based on the 
programming and optimization experience, a hot strip coiler with new coiling 
process, which operates with position control and pressure control is intro- 
duced. Control method and operational mode are given and auto jump control is 
described specially. The configuration of the system will be more compact. It 
acts slightly, obviously reduces dynamic loads and noises, and improves the 
surface quality of strip. the wrapper roll control system is introduced in detail. 


Keywords: hot strip, wrapper roll, AJC control. 


1 Introduction 


With the large demand of hot strip in the market, high quality of the hot strip becomes 
more important. Down Coiler area is the finished product area of the hot strip area 
and has great influence on the quality of band steel. Wrapper roll is the most impor- 
tant parts of the whole system. The high precision of position and force control of 
wrapper roll control system enhances the quality of band steel wrapping. 


2 Control Method 


There are two control methods in wrapper roll control. One is position control which 
is called CPC (constant position control), and the other is pressing force control, 
which is called CPR (constant pressure rolling). 


2.1 Position Control 


Wrapper roll (WR) position is controlled by comparing WR gap setting with the ac- 
tual gap. The actual gap is detected by rotary type Manescale which is installed at 
pivot of wrapper arm. The WR gap setting is converted to rotation angle of WR arm 
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in the DCPC unit. DCPC unit calculates the deviation between WR arm rotation angle 
setting and WR arm actual rotation angle. The deviation is multiplied by the control 
gain. And then, DCPC unit outputs as servo valve opening amount command. The 
illustrated scheme is as followed. 


DCPC unite 


WR arm rotation 
/ angle deviations 
GAP - 
WR gap settinge OE ANGLE aA cPC control 
(oulss) > Igains 
WR angle actual values 


(MAGNESCALE pulse) 


Servo valve openings 
amount command+ 


Fig. 1. Position control 


2.2. Pressing Force Control 


Pressing force control is illustrated in figure 2. WR pressing force is controlled by 
comparing WR force setting with the actual force. The actual force is detected by 
pressure transducers which are installed at cylinder head side and rod side. The WR 
force setting is converted to cylinder pressure considering WR position and WR arm 
weight. in the DCPC unit. DCPC unit calculates the deviation between WR pressure 
setting and actual pressure. The deviation is multiplied by the control gain. And then, 
DCPC unit outputs as servo valve opening amount command. 


DCPC unite 
WR pressure 
deviatione 
: + PR control Servo valve openings 
WR force setting+ FORC : -_ as pening 
3 7. [gains amount commande 
WR pressure actual value+ 
(Pressure transducen+ 


Fig. 2. Pressing force control 


3 WR Control System 


3.1. AJC Control System 


AJC (automatic jumping control) control system consists of PLC portion and D-CPC 
unit portion. PLC outputs the reference and the command to D-CPC unit and D-CPC 
unit consists of position control loop (CPC) or pressing force control loop (CPR). 


3.2 Position Control Loop 


Wrapper roll gap position control system compares the feedback with the reference in 
the unit of wrapper roll rotation angle converted from wrapper roll gap. Gap reference 
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from PLC gap reference is made in the DCPC unit during AJC etc. is converted to the 
reference of wrapper roll arm rotation angle. The other hand feedback signal is ob- 
tained by the D-CPC unit by counting pulses from rotary MAGNESCALE installed at 
the wrapper roll arm. The between reference and feedback is multiplied by the differ- 
ence control gain “G”’. and then this multiplied value is output to servo valve via servo 
amplifier as servo valve opening amount command. 

The above mentioned position control and pressing force control mentioned below 
are done every Imsec in the D-CPC unit. 

The above mentioned is fundamental position control function, in addition to this 
there are speed compensation function and null compensation function as auxiliary 
function. Speed compensation function is the function of make the response faster and 
to prevent vibration of wrapper roll by detecting rotation speed of wrapper roll arm and 
outputting some open or close amount command as compensation to servo valve. 

Null compensation function is the function to improve the wrapper roll gap setting 
error caused by null point shift of the servo valve. The difference between the refer- 
ence and the feedback are input null compensation circuit and from this circuit the 
some open or close amount command as compensation is output to servo valve. 


3.3 Pressing Force Control Loop 


When pressing force control is done, pressing force reference and pressure of head side 
and rod side of the wrapper roll cylinder are converted to the pressing pressure of head 
side. The difference between the reference and the feedback is multiplied by the con- 
trol gain “G” and then this multiplied value is output to servo valve via servo amplifier 
as servo valve opening amount command. 

Straight pressing force direction of the wrapper roll cylinder is different from the di- 
rection of wrapper roll toward the mandrel. In other word the direction changes by 
wrapper roll gap. Therefore wrapper roll pressing force reference conversion to cylind- 
er pressure reference (the pressing pressure of head side) is done considering wrapper 
roll gap in the D-CPC unit. 

Pressure transducers installed at head side and rod side of wrapper roll cylinder are 
used as feedback signal. However rod side pressure signal is converted to head side 
pressure considering the difference of cylinder area of head side and rod side. And the 
cylinder pressure feedback is obtained as the pressure converted to head side pressure. 


4 Jumping Control 


4.1 AJC Control 


Automatic jumping control is carried out to minimize the top mark of the strip. The 
system is illustrated in figure 3. The top mark occurs when the strip head end hits 
against the wrapper roll. Therefore control concept of automatic jumping control (AJC) 
is as follows. 

The wrapper roll jumps (start position control based on step reference) before strip 
head end reaches to the wrapper roll. After the strip head end passes through the wrap- 
per roll position, wrapper roll goes down towards the mandrel and starts pressing force 
control (CPR). The jump timing and the pressing force control timing are made by strip 
head end tracking function. 
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Fig. 3. AJC control 


4.2 AJC Control Process 


When the strip head end breaks both laser sensor beams (in normal operation, both 
OS and DS laser are used for detecting the strip head end), the strip head end tracking 
for AJC is started to generate wrapper roll jump and press timing signal for each 
wrapper roll. 

The pulse signal from PLG attached to the bottom pinch roll is used for this track- 
ing. Wrapper rolls start jumping and pressing in accordance with the timing signal 
generated as above. One example of the wrapper roll operation during the strip coiling 
with AJC (JUMP) is given. When the strip head end breaks both laser sensor beams (in 
normal operation, both OS and DS laser are used for detecting the strip head end), the 
strip head end tracking for AJC is started to generate wrapper roll jump and press tim- 
ing signal for each wrapper roll, same as AJC selected. (The strip head end tracking is 
performed, and wrapper roll jump and Press timing signals are generated even though 
AJC selection is off.) 


4.3 Jump Timing and CPR Timing Compensation 


For prevention of the WR to hit against the strip head end at AJC coiling, wrapper roll 
must reach at preset jump position before the strip head end reaches hit point which is 
calculated in SH.93. The Jump timing and CPR timing are calculated based on the 
following formulas. 


Lyi=Voer*Tit¥3=Ver(TjpitATji)+7¥3 [mm] (1) 
Loi=Y2 [mm] (2) 

Relation between jump amount and time to jump is as the following formulas. 
Ty=T pit AT HTC +oji) [sec] (3) 


Where T},; is time to reach preset jump position under the condition, that serve valve 
is not in full open, aj; is coefficient of jump timing compensation at aturation of 
serve valve, AT; is time compensation, Vpp is bottom pinch roll speed at HMD 
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Fig. 4. Jump timing and CPR timing compensation 


on timing, y3is jump start compensation length(preset in AJC panel) and y2is CPR 
start compensation length(preset in AJC panel). 

WR calibration is conducted to match the actual gap between WR and MD and the 
WR gap actual value memorized in the AJC control panel. 

WR calibration operation is required in the following cases: After starting up the 
AJC control panel (power on) including the replacement of the control unit and the 
position sensor; After starting up the PC in the AJC control panel (running the PC 
from its stop status); After replacing WR; After replacing MD and MD segment. 

WR gap is controlled using WR arm rotation angle as feedback signal. Therefore, 
converting WR gap to WR rotation angle or WR rotation angle to WR gap are neces- 
sary for WR position control. The formulas for conversion are as follows. 

WR gap to WR rotation angle (pulse) conversion is as followed. 


Lit rig -(PM*PM Lg) 
@=cos' : a (4) 
2xLixL2 


0 
Magnescalenpulse = ——x100000 
: 360 ©) 
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WR rotation angle (pulse) to WR gap conversion is as followed. 


_ Magnescalepulse 
100000 


x360 
(6) 


Dm+Dw 
2 (7) 


MAGNESCALE# 
UI (.o0000pulseitey) 


G=,|L? +L2? -2xL1xL2xcos (0+ a) - 


Fig. 5. WR gap to WR rotation angle conversion 


Where Dm is Mandrel diameter, Dw is WR diameter, G is WR gap, 8is WR arm 
rotation angle from WR gap “0” and ais WR arm rotation angle (WR gap “0”) based 
on position of center of mandrel 


5 Conclusion 


The hot strip is drived by cylinder that has fast response speed and high control preci- 
sion. It help the wrapper roll operate effectively and enhance the quality of band steel. 
The control system has been one of the most important functions of the whole control 
system. 
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Abstract. Digital certificate is the key element that implements trust and trust 
authentication in e-government and e-commerce, however CA may be canceled 
and revoked in advance by accident in practical applications. Therefore it is 
critical for the users of this certificate to obtain the latest certificate status as 
soon as possible, it is also critical for realizing credibility in PKI system. It 
summarizes the advantages and disadvantages in PKI system, and discusses the 
difficulties in practical application by analyzing the OCSP protocol. Some 
corresponding methods are proposed to solve the above problems. 


Keywords: PKI, digital certificate, certificate revoke, OCSP. 


1 Introduction 


As e-commerce and e-government continued popularity to ensure transmission of 
digital information security, in addition to adopting stronger encryption algorithm, the 
need to establish trust and mechanism of trust authentication, that the parties in partic- 
ipating e-commerce must have an identifier that can be verified, which is a digital 
certificate. Digital certificate is unique, and its public key associates with the entity 
itself. The authentication of certificates solves the security issues in online trading and 
settlement, Including the establishment of e-commerce trust relationship between the 
subjects, namely the establishment of safety certification authority(CA) and selection 
of secure protocol (such as SET, SSL)[1]. CA issues Public key certificate to user 
which has expiration data. However, in many cases the validity of the certificate may 
be invalid before the expiration data. For example: certificate subject name changing, 
the relationship of CA and the subject of certificate changing (such as an employee 
and his organization end the employment relationship), the private key of the leaked, 
destroyed or lost and so on. In these cases, the certificate must be canceled and re- 
voked, Then the user of using this certificate should as soon as possible to obtain the 
latest certificate status which is critical for realizing credibility in PKI system. 


2 The Theory of OCSP 


Nowadays, the revocation of PKI system is widely implemented by CRL. A general 
description of CRL is in [2]. This system works as the issuance of certificate does, 
which means the information without trust of correspondence and service system 
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needn’t encryption. This method is easily implemented and is widely adopted because 
of its low requirements of system resources. But, the shortage of this method is ob- 
vious. Because CRL is issued at regular intervals, if one certificate is abolished, the 
message of revocation has to be apprized in the coming cycle. Meanwhile, because of 
the existence of all information of revocations, CRL will become very huge with 
more and more revocation information. With the increase of the size of CRL, the 
verification period will be longer, if the latest CRL is continuously downloaded, it 
will suffer great network losses and result in CA server inefficiency and network 
congestion, which keeps clients from getting CRL information. On account of the 
periodicity of information issued by CRL, time delay between certificate being re- 
voked and clients getting message occurs. The creditability of PKI is confronted with 
great challenges because of the time delay. Some special discussion on improving 
methods is presented in [3] and [4]. OCSP can get the information whether the certifi- 
cate has been revoked in time. As the supplement of CRL, the application can test 
“revoked” state on account of OCSP. OCSP allows the application to determine the 
“revoked” state of a certificate. Contrasting to CRL, OCSP can meet many require- 
ments and provide quicker “revoked” message and other states information. A general 
description of OCSP is in [5].The two main advantages of OCSP are: efficiently re- 
ducing the network load and providing timely state information of a certificate for 
clients. Chart | describes the interaction between clients and OCSP Responder. 


Status of 
cer A 


Fig. 1. Interactive process of OCSP 


The protocol of OCSP is an easy request/response protocol, which provides on-line 
revocation by means of the trusted third party. An OCSP client sends a request for 
state detection to an OCSP responder and keeps the certificate in receiving order until 
receiving a response from Responder. A request of OCSP consists of a protocol 
version number, type of service request and one or more certificate identifiers. A 
certificate identifier consists of identifying names of the certificate authority organiza- 
tion, the hash values of shared key of the certificate authority organization and serial 
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number of the certificate and additional expansion. Response consists of certificate 
identifiers and certificate states, namely, normal, revoke or unknown. If a certificate is 
in a revoked state, the exact time and the reason for revoking are required to describe. 


OCSPResponse::=SEQUENCE { 


responseStatus OCSPResponseStatus, 
responseBytes [0] EXPLICIT ResponseBytes OPTIONAL } 
OCSPResponseStatus ::= ENUMERATED { 
successful (0), —Response has valid confirmations 
malformedRequest (1), — Illegal confirmation request 
internalError (2), — Internal error in issuer 
tryLater (3), —Try again later 
— (4) is not used 
sigRequired (5), —Must sign the request 
unauthorized (6), —Request unauthorized 


3  OCSP Analysis 


OCSP can provide timely and latest information for certificate state, but it doesn’t 
mean the response from OCSP is zero-delay contrasting to the current state of certifi- 
cate. OCSP protocol does not give a clear definition on the background database for 
collecting the revoked information. As Chart 1 shows, the background of OCSP still 
adopts CRL and other similar means for collecting the revoked certificate informa- 
tion. The real-time performance of OCSP Responder is determined by the time-delay 
of gathering the information. Therefore, we cannot simply think OCSP responder can 
update information automatically and provide timely service. 

OCSP response must be guaranteed by digital signature from a trusted party and it 
is not tampered during transmitting process. The third party signature may be a certif- 
icate authorized by CA or an entity admitted by CA and clients must acquire the cer- 
tificate copy of the public key. In order that a proper responder is conveniently 
achieved by clients, authoritative information access extending field of X.509 shared- 
key certificate indicates the address of OCSP Responder, which means the localities 
of OCSP Responder are attached to certificate. 

Because of the adoption of request-responder, OCSP has no need to distribution of 
CRL and it can eliminate the distribution limitations of CRL; Because every respond- 
er from OCSP is smaller than CRL, OCSP responds to information model that relies 
on C/S model, which supports more clients; OCSP adopts HTTP, LDAP and so on, 
which travels the protocols of TCP/IP, so the configuration and realization of OCSP 
can be conveniently achieved; Because the requester makes information requests for a 
particular certificate not an invalid certificate list, OCSP provides more effective 
solutions; In OCSP the time of sending request and the responding of showing certifi- 
cate valid can gain the resisted denial to trades history. 

In spite of the mentioned advantages, OCSP in nature relies on CRL, which results 
in some disadvantages. OCSP Responder gets the revoked information of certificate 
through the periodical CRL; If plenty of requests are submitted to the Responder in a 


314 X. Deng et al. 


definite interval, OCSP Responder will easily be attacked by DoS. And the Responder 
need to sign for responses, which consumes the vast majority of CPU time; because 
the Responder does not sign for returned false notifications, attackers would fake 
returned false notifications to attack. The protocol indicates certificate state informa- 
tion comes from Responder, but it does not definitely indicate how to get certificate 
state information. As for clients of OCSP Responder, the best approach to getting the 
information is that CA directly transmits it to Responder. According to the relation 
between CA and OCSP Responder, certificate authority (CA) organization can for- 
ward revoked information of information and then offers it to clients immediately. 
However, in practice most Responders gain revoked information of certificate through 
periodical CRL. 


4 Solutions to the Problems 


On the basis of the above analyses if a reply that meets a demand of real-time state 
needs to be realized, that is to say providing a CRL that has not been stored, it is from 
a real-time certificate storage. Allow Responder offers high performance extended 
information without CRL. In abstract terms, the responder is providing an implemen- 
tation of an authenticated dictionary that responds to membership queries from rely- 
ing parties. A conventional OCSP responder answers the question "Is x excluded from 
D?", while an improved OCSP responder answers the question "Is x present in 
D?"[6]. When returning a response, the responder only indicates that request certifi- 
cate that is present in its valid certificates, a definite verified list that does not verify 
certificate in any way and operates directly on CA certificates storage. 


4.1 Some Improving Methods 


OCSP defines a complex certificate identifier as a part of certificate. Some parts are 
transformed by Hash and some are not, and even need to regard datum of other 
certificates as a part of identifier, which makes it very difficult verifying a unique 
certificate. The main goal is providing an identifier that is easily and widely adopted 
by all certificates, and thereby neglects their model and coding. Meanwhile, the new 
certificate identifier can be compatible with traditional OCSP certificate identifier. 
Therefore, the definition of new identifier can be adopted by new protocol as follows 
to extend existent OCSP identifiers: 


NEWldentifier ::= SEQUENCE { 
certHash OtherHash, 
legacyID IssuerAndSerialNumber OPTIONAL  } 


SHA-1 arithmetic is adopted by certHash to hash certificate, which applies fingerprint 
system or other similar systems. And this operational and correct verification is easily 
achieved. Here Othterhash adopts shalHash ::= OCTET STRING SIZE(20) directly 
or other methods in [7]. LegacyID provides feasible methods that is based on tradi- 
tional CRL. Its complete definition makes reference to [8]. This identifier can easily 
develop from X.509 and it can be widely applied in CMS and S/MIME. Regardless of 
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whether this identifier applies a traditional method or in an uncertain state, this iden- 
tifier can be adopted. And this method can be run in constrained environments. 

Because of its CRL-based origins, OCSP can only return a negative response. For 
example, when fed a freshly-issued certificate and asked "Is this a valid certificate", it 
can't say "Yes" (a CRL can only answer "revoked"), and when fed an Excel spread- 
sheet it can't say "No" (the spreadsheet won't be present on any CRL). This problem 
interacts badly with the one mentioned above, so there is no way to confirm what the 
actual problem is. The second major design goal of OCSP then is to provide a clear, 
unambiguous response to any query, either "This certificate is definitely valid right 
now", "This certificate is definitely not valid right now", or "The object you have 
queried doesn't exist"( The standard OCSP cannot perform the operations mentioned 
above.). The new one can apply basic and extended responses. The definition of basic 
response is as follows: 


NewResponseBasic ::= SEQUENCE { 
certHash OtherHash, 
status BOOLEAN, 
extensions Extensions OPTIONAL — } 


A returned value 'true' indicates that the certificate is valid right now. A returned 
value ‘false’ indicates that the certificate is not valid right now. This is a clear, unam- 
biguous response that is useful for the clients who want to know definitely whether 
they can safely use it or not. Relying parties who require further information should 
use the extended response. The definition is as follows: 


RESPONSEINFO ::= CLASS { 
&status CertStatus UNIQUE, 
&StatusInfo OPTIONAL 
} WITH SYNTAX { &status [WITH DETAILS IN &StatusInfo] } 
NewResponseExtended ::= SEQUENCE { 
certHash OtherHash, 
status RESPONSEINFO.&status({ CertStatus }), 
statusInfo RESPONSEINFO.&StatusInfo({ CertStatus }{ @status }), 
extensions Extensions OPTIONAL _ } 
ResponseTypes RESPONSEINFO ::= { 
{ statusOK } | 
{ statusRevoked WITH DETAILS IN RevocationInfo } | 
{ statusSuperseded WITH DETAILS IN SupersededInfo } | 
{ statusUnknown }, 


} 
CertStatus ::= ENUMERATED { 
statusOK (0), 
statusRevoked (1), 
statusSuperseded (2), 
statusUnknown (3), 
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The consistency of statusOK value with the basic response shows certificate valid. If 
the certificate has been revoked or rendered invalid in some form, the responder will 
return a "revoked" response. The definition of RevocationInfo is as follows: 


RevocationInfo ::=SEQUENCE { 
revocationTime RelativeTimeInfo OPTIONAL, 
revocationReason CRLReason OPTIONAL — } 


RevocationTime indicates the time at which the revocation or invalidation took place, 
if available. RevocationReason provides the reason why the certificate was revoked or 
rendered invalid, if available. 

OCSP uses timestamps for all responses, assuming that the relying party and res- 
ponder somehow have perfectly synchronized clocks. This is almost never the case, 
with systems having been encountered with clocks that are as much as decades out of 
sync. The new method does not rely on synchronized clocks for its operation. 

This method eliminates the need that the responder and relying party are in need of 
precisely synchronized clocks. The relying party may use the absolute revocation time 
if they have a mechanism for precise clock synchronization with the responder, or the 
difference between the two times to determine how far in the past relative to its own 
clock the revocation took place. The definition is as follows: 


RelativeTimeInfo ::= SEQUENCE { 
localTime GeneralizedTime, 
timeValue GeneralizedTime } 


5 Conclusion 


This paper solved the problems of the excessive complexity of certificate identifier of 
OCSP, unclearness and inaccurateness of response, an overreliance on responder and 
the disadvantages of synchronized clocks. And the new protocol can be applied better 
in some resource-constrained environments because of these improvements. The 
adoption and operation of these improvements can also solve the existent problems of 
a real-time certification querying mechanism, which will be useful for the develop- 
ment and adoption of PKI. We only accept references written using the latin alphabet. 
If the title of the book you are referring to is in Russian or Chinese, then please write 
(in Russian) or (in Chinese) at the end of the transcript or translation of the title. 
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Abstract. To improve the retrieval rate of contourlet texture image retrieval 
system, a contourlet-2.3 transform based texture image retrieval system was 
proposed. In the system, the contourlet transform was contourlet-2.3, a new ver- 
sion of the original contourlet, sub-bands absolute mean energy and kurtosis in 
each contourlet-2.3 sub-band were cascaded to form feature vectors, and the 
similarity metric was Canberra distance. Experimental results on 109 brodatz 
texture images show that using the features cascaded by absolute mean energy 
and kurtosis can lead to a higher retrieval rate than the combination of standard 
deviation and absolute mean energy which is most commonly used today under 
same dimension of feature vectors. Contourlet-2.3 transform based image re- 
trieval system is superior to those of the original contourlet transform, non- 
subsampled contourlet system under the same system structure with same 
dimension of feature vectors, retrieval time and memory needed. 


Keywords: content based image retrieval; contourlet-2.3 transform; texture im- 
age; retrieval rate; contourlet transform; non-subsampled contourlet transform. 


1 Introduction 


With the fast development of internet and all kinds of imaging system technology, 
image resources are expanding more quickly than ever, the classic retrieval approach- 
es using keywords can not describe the visual characters in the query image such as 
color, texture and contour. To overcome the difficulties of keyword retrieval systems, 
a new type of retrieval system called content-based image retrieval (CBIR) system 
was proposed[1],[2]. In the CBIR system, before retrieval work, every image in the 
image database which will be retrieved should be represented with a feature vector, 
all the vectors should be placed together to form a feature vector database, that is, a 
feature vector is used to represent a real image and is linked to the corresponding 
“true image”. When a query image is input, the retrieval system will extract its fea- 
tures to form a feature vector which is used to compare the similarity between the 
query vector and each vector in the database, the N number most similarity vector will 
be chosen as the retrieval result. The most important technology of the CBIR system 
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is feature matching which includes three aspects: some certain transform (e.g. wavelet 
transform), feature extraction and similarity measure. During the past ten years, wave- 
let transform has played an important role in the system due to its good characters of 
multi-scale and local time-frequency [3], [4], [5]. Yet, some disadvantages of wavelet 
transform including shift sensitivity and the lack of directionality limits its abilities in 
texture representation. To overcome the deficiencies of wavelet transform, researchers 
have developed many improved approaches, such as: ridgelet, curvelet, beamlet, con- 
tourlet, bandelet, etc. In the family of “X-let”, contourlet transform (CT) [6] is more 
acceptable greatly because of its easier implementation and strong ability in direction 
information representation. Since the transform was proposed in 2002 by Do, several 
modified versions have been proposed and form a new family including non- 
subsampled contourlet transform (NSCT), semi-subsampled contourlet transform [7], 
and a sharper frequency localization contourlet version [8], etc. Non-subsampled 
contourlet transform which was proposed by Cunha in 2005 has higher shift insensi- 
tivity level than the original contourlet transform but has higher redundancy as 
described by (1), where S denotes the scale number of the transform. The high redun- 
dancy makes the transform much more time consuming and much larger memory 
needed. 


Ss 
Re=1+ 2’. (1) 


s=l 


To overcome the limitation of high redundancy, Cunha presented a compromise trans- 
form which was a cascade of non-subsampled Laplacian pyramid and critical subsam- 
pled directional filter banks, and made the redundancy fall to S+1, Here we call the 
transform Contourlet-S, and CTS as abbreviation. To further reduce the redundancy of 
the transform, and aliasing, Lu and Do proposed a modified version of the original 
contourlet transform including three different variants and their redundancy ratio are 
approximately 2.3, 1.6 and 1.3, respectively. For convenience, we call them contourlet- 
2.3, contourlet-1.6 and contourlet-1.3. In [8], Lu and Do announced that contourlet-2.3 
performs better than the other two versions in image de-noising application. In this 
paper, we will use contourlet-2.3 to implement a new retrieval algorithm. 

Ever since the contourlet transform was proposed, many literatures reported the 
application approaches in many different areas including CBIR systems [9], [10]. But 
the original based retrieval system has very limited retrieval rate due to the drawbacks 
as mentioned before. 

On the other hand, all the literatures as we know use absolute mean energy and 
standard deviation of each sub-band coefficients as features, which will be shown has 
low retrieval rates. Here, in this work, we will use absolute mean energy and kurtosis 
as features. 

The remaining parts of this paper are organized as follows: key techniques of con- 
tourlet-2.3 texture image retrieval algorithm will be covered in section 2, experimen- 
tal method and results will be shown in section 3 and in the section 4, the last section, 
we will conclude the whole paper and give some future work directions. 
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2 Key Techniques of Contourlet-2.3 Texture Retrieval Algorithm 


The key technologies of contourlet-2.3 texture image retrieval system include con- 
tourlet-2.3 transform, feature vectors construction and distance measure. So we will 
introduce them separately in this section. 


2.1 Contourlet-2.3 Transform 


We will review the contourlet-2.3 transform in this section to explain why we choose 
it to implement our retrieval system. 

The original contourlet transform is constructed as a combination of Laplacian Py- 
ramid (LP) and directional filter banks (DFB), where the LP iteratively decomposes a 
2-D image into low pass and high pass sub-bands, and the DFB are applied to the high 
pass sub-bands to further decompose the frequency spectrum. Using ideal filters, the 
contourlet transform will decompose the 2-D frequency spectrum into trapezoid- 
shaped regions. Due to this cascade structure, multiscale and multi-directional de- 
composition stages in the contourlet transform are independent of each other. One can 
decompose each scale into any arbitrary power of two’s number of directions, and 
different scales can be decomposed into different numbers of directions. This feature 
makes contourlet a unique transform that can achieve a high level of flexibility in 
decomposition while being close to critically sampled (up to 33% redundancy, which 
comes from the LP structure). 

Experiments have shown that the original contourlet transform has poor time- 
frequency localization character which leads to severe artifacts and aliasing exist in 
the recovered images. Lu and Do proposed a new design based on the basic structure 
of the original contourlet transform to enhance the localization of the contourlet basis. 


In their design, a new LP structure which is composed of Di(w) and 


Li(w)Gi=0,1,2...) replaced the LP structure in the original CT. The DFB in the new 


design uses the same one as the original transform. Depending on the different sub- 
sampled matrix in LP-like structure, the redundancy of the new contourlet transform 
is different. The choices for the subsampled matrix (d, d) maybe (1,1), (1.5,1.5) 
or (2,2 ), and the redundancy of the new version will be about 2.3, 1.6 and 1.3 
respectively. 

It is notable that the new LP-like structure considered the practical reasons which 
lead to the deficient time-frequency localization, the new transform can avoid most 
aliasing and artifacts problem which widely exists in the first CT de-noising method. 
We communicate with Mr. Yue Lu through E-mail and appreciate to get their CTSD 
toolbox. Experimental results show that the new transform has better localization 
basis than that of the original version. According to different redundancies the retriev- 
al results are different, roughly speaking, higher redundancy results in higher retrieval 
rate. In this paper, we use contourlet-2.3 to implement our retrieval system. 


2.2 Construction of Feature Vectors 


Many methods have been used to construct feature vectors including energy com- 
bined with standard deviation, generalized Gaussian model and co-occurrence matrix, 
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here we use the absolute mean energy value and kurtosis of each contourlet domain 
directional sub-band. 

For a sub-band in contourlet domain, we can use (2) to calculate its absolute mean 
energy, and the kurtosis used here is defined as (3), where s and k denote the index of 
scale and direction, M,N stand for the row and column number of the sub-band coef- 
ficients, W is the coefficient of row M and column N in sub-band indexed by s and k, 
uu andorepresent mean and standard deviation, respectively. 


1 M N 


m=l n=l 


gD ere ; > sak UM, re ii sk) : (3) 


Each feature vector is constructed by cascading the energy value and kurtosis of each 
contourlet domain directional sub-band. For every image in the database which will 
be retrieved, a certain feature vector can be obtained and then is put into the feature 
vector database as the signature of the corresponding image for retrieval. 


2.3 The Determination of Similarity Measure 


The similarity measure is used to calculate the distance between different feature 
vectors. Up to now, at least there are 10 different types of distance measure, they are: 
Manhattan (L1), Weighted-Mean-—Variance (WMV), Euclidean (L2), Chebychev (L), 
Mahalanobis, Canberra, Bray-Curtis, Squared Chord, Squared Chi-Squared and Kull- 
back Leibler. Kokare compared the nine measures except Kull-back distance (KLD) 
and declared that Canberra and Bray-Curtis are superior to others [11], and we com- 
pared Canberra and Kull-back distance, the result is that Canberra is more suitable in 
such kind of situation. So in this paper, we directly choose Canberra distance as dis- 


tance measure. The Canberra distance is defined as (4), where d(x, y) means the 
distance between vector X,y , D denotes the dimension of the feature vectors, 


X;, y; are the i-th components of X and Jy, respectively. 


d(x,y) = eae Al (4) 


racy 


3 Experiment and Results 


In this section, we will introduce the implementation approach of the contourlet-2.3 
texture image retrieval system, and evaluate the retrieval rate of the algorithm. Fur- 
thermore, we will study the factors which influence the retrieval rate and how to im- 
prove the retrieval rate of the system. 
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3.1 Experimental Objects 


The experimental objects are the 109 texture images come from Brodatz album [12]. 
For each 640x640 pixels image, we cut them into non-overlapped 16 sub-images and 
each one is 160x160 pixels size, then we can obtain an image database with 
109x16=1744 sub-images. The 16 sub-images come from the same original image 
can be viewed as the same category. 


3.2 Experimental Approach 


The experimental approach can be divided into 4 steps: 


Step 1: For each sub-image in the database, we used contourlet-2.3 to transform it into 
contourlet-2.3 domain. In contourlet-2.3 domain, for each image, we calculated the 
absolute mean energy and kurtosis of each directional sub-band using (2) and (3), 
respectively. Then we cascaded them together as the feature vector of that image. 
Choosing the decomposition parameter as [4 3 3] means that the numbers of direc- 
tional sub-bands are 16, 8, 8, adding the low frequency sub-band, the number of 
sub-bands are 33, each sub-band needs two parameters to describe, so, for every sub- 
image in the database, the dimension of feature vector is 66. Using the same method 
for every sub-image, we can extract 1744 feature vectors altogether. All the feature 
vectors were put together into feature vector database. 


The following steps used to evaluate the performance of the retrieval system. 


Step 2: Select the first sub-image in the database, using (4), calculate the Canberra 
distance between its feature vector and every one in the feature vector database. Then 
find the N=16 nearest images as the retrieval result. Examine how many images be- 
long to the corresponding group, and divided the value by 16 to get the retrieval rate; 


Step 3: For next image feature vector in the 1744 sub-image vector database, using 
the same method as in step 2, calculate the average retrieval rate R, and repeat the 
procedure until all the feature vectors have been processed. 


Step 4: For N € {16, 20, 30, 40, 50, 60, 70, 80, 90, 100}, repeat step 2 and 3, calcu- 
late the average retrieval rate for each N. 


Step 2 to Step 4 can be described by formula (5) as follows, where q=1744, R(p) 
denotes the average retrieval rate for each p© {16, 20, 30, 40, 50, 60, 70, 80, 90, 
100}, hence 10 retrieval results can be acquired. S(p, i) is the number of images be- 
long to the correct group when the i-th image used as query image. 


1 1 ; 
RP) =D RDI 


f) 
q i=l 16 


3.3 Experimental Results 


Using the above approach, we can get the average retrieval rate of contourlet-2.3 
texture image retrieval system as shown in table 1. 
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It should be noted that CO stands for contourlet-2.3 with the combination feature 
of absolute mean energy and standard deviation which is used widely in wavelet and 
contourlet-like situations as in (6) while CN means contourlet-2.3 with the combina- 
tion feature of absolute mean energy and kurtosis which was proposed in this paper. 
The row named “av” means the average retrieval rate of the corresponding coloum. 

In table 1, we compared the retrieval rates and some other systems including the 
original contourlet transform (CT), and non-subsampled contourlet transform (NSCT, 
here use NT for short) under the same structure and decomposition parameters. In the 
experiment, for CT condition, we used “pkva” and “9-7” bi-orthogonal wavelet as 
DFB and LP filters, respectively. For NSCT condition, the “maxflat” and “dmaxflat7” 
were used as LP and DFB filters, respectively. For contourlet-2.3 (CT23) transform 
“pkva” were used for DFB filters. 


1 M N 
OCS.) = [DDI Wea mort) ba PD (6) 


m=1 n=1 


From table 1 we can see that no matter what decomposition parameters were selected, 
contourlet-2.3 retrieval system always has higher average retrieval rate that CT sys- 
tem, especially under small N values. Comparing with NSCT, CT23 has some little 
privilege. It should be noted that, CT23 has significant lower redundancy than NSCT, 
hence needs much less time in building the feature vector database, which is a wel- 
come character at present time due to too fast expanding image database. 


Table 1. Comparison of different texture image retrieval algorithms (%) 


[4 3 3] [3 2 2] [3 32 2] 


N CT | NT | CO; CN | CT | NT | CO | CN | CT | NT | CO | CN 


16 | 68.3 | 71.4 | 67.3 | 71.2 | 70.0 | 71.0 | 68.2 | 70.8 | 70.6 | 72.0 | 69.1 | 72.5 
20 | 74.2 | 76.7 | 72.5 | 76.2 | 74.9 | 75.9 | 73.5 | 76.1 | 76.1 | 77.4 | 74.5 | 78.2 
30 80.5 | 81.3 | 78.1 | 81.9 | 80.1 | 80.8 | 78.7 | 81.3 | 81.6 | 81.9 | 79.5 | 83.1 
40 83.5 | 84.1 | 81.2 | 85.0 | 83.1 | 83.6 | 81.3 | 84.2 | 84.1 | 84.2 | 82.2 | 85.6 
50 85.7 | 86.2 | 83.0 | 87.1 | 85.1 | 85.6 | 83.1 | 86.3 | 86.0 | 85.8 | 84.1 | 87.5 
60 87.0 | 87.6 | 84.4 | 88.5 | 86.8 | 87.2 | 84.5 | 87.8 | 87.5 | 87.1 | 85.8 | 88.9 
70 88.1 | 88.7 | 85.7 | 89.7 | 88.0 | 88.3 | 85.7 | 88.9 | 88.7 | 88.1 | 87.1 | 89.9 
80 89.1 | 89.7 | 87.0 | 90.6 | 89.0 | 89.2 | 87.0 | 89.8 | 89.7 | 89.0 | 88.4 | 90.7 
90 | 90.0 | 90.4 | 87.9 | 91.4 | 89.8 | 90.0 | 88.0 | 90.5 | 90.6 | 89.8 | 89.3 | 91.5 
100 | 90.6 | 91.0 | 88.8 | 92.1 | 90.6 | 90.7 | 89.0 | 91.1 | 91.3 | 90.5 | 90.2 | 92.2 
av 83.7 | 84.7 | 81.6 | 85.4 | 83.7 | 84.2 | 81.9 | 84.7 | 84.6 | 84.6 | 83.0 | 86.0 


Table | show that decomposition parameters including the scale number and direc- 
tional sub-band number have great influence on the retrieval rates for all the four 
retrieval systems. With the increasing of scale number, the retrieval rate tends rising, 
and with the increasing of directional number, the average retrieval rates tends falling. 
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For the former, the increasing of scale number means to describe the same image 
from more different distances, but the scale number should be limited so that the ap- 
proximation image is no smaller than 10x10 pixels for robust reason. For the latter 
phenomenon, the reason rises from the texture images of this database are not rich 
enough texture information. 

We can also find that the features what we use can make significant effect to re- 
trieval rates. Even using the original contourlet transform with absolute mean energy 
and kurtosis features can perform better than contourlet-2.3 with absolute mean ener- 
gy and standard deviation features. Of course, contourlet-2.3 with absolute mean 
energy and kurtosis features is superior to CT and NSCT situations. 


4 Conclusion 


A contourlet-2.3 based texture image retrieval system was proposed in this paper 
which utilized the CT23 combined with the Canberra distance and the features includ- 
ing absolute mean energy and kurtosis of each sub-band coefficients. The new 
retrieval system has higher retrieval rate than absolute mean energy and standard 
deviation under same structure and same dimension of feature vectors. The new algo- 
rithm has higher retrieval rate than the original contourlet transform and non- 
subsampled contourlet transform under same structure. 

We will further improve the retrieval rate through the means including new trans- 
forms, features and similarity measure. 
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Abstract. This paper presents a new predict demonstration approach for robot 
soccer path planning under complex and uncertain environment. By predicting 
and analyzing the future position and attitude of concerned object, the position 
and attitude of the object is controlled by demonstration algorithm. The pro- 
posed method is successfully used in the obstacle avoidance and is realized on 
the MiroSot 3vs3 simulating platform. Experiment results show that this algo- 
rithm has good real-time ability and adaptability to environment. 


Keywords: obstacle avoidance, predict control, path planning, robot soccer. 


1 Introduction 


Robot soccer game is full of intense competition where accurate collision-free path 
planning is one of the most important challenges. During the realization of path plan- 
ning, both the dynamic behaviors and tendency of the robots and obstacles should be 
considered. The changing state of robots will affect the state of obstacles, which will 
accordingly affect the state of robots. As a result, it is quite difficult for path planning 
since both robots and obstacles have dynamic characters. Recently, there are lots of 
methods for robot path planning such as artificial field algorithm, genetic algorithm, 
neural network, adaptive control6, etc[1-3]. It seems that none is adaptive to all cir- 
cumstance as each has its advantages and disadvantages. Furthermore, in a robot 
soccer game, successful obstacle avoidance and rapidness are both of great impor- 
tance. This paper presents an approach based on predict demonstration method by 
predicting the uncertainty of the size, direction of the movement of the obstacle [4] 
and then using demonstration method to controlled the object. The proposed 
algorithm is successfully applied in the obstacle avoidance by the MiroSot 3vs3 simu- 
lating platform. Results show that this algorithm has good real-time ability and adap- 
tability to environment. 


2 Demonstration Algorithm 


The game field is divided into several zones as shown in fig.1. Each zone is corres- 
ponding to a value which is defined as aggressive coefficient (AC). The larger value of 
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AC means more successful attacking probability at that very point. If we divide all 
behaviors of the robot at any point into one zone, then each robot will have an availa- 
ble zone. So how to choose an optimal point becomes to the key of this problem. Ob- 
viously, it is not efficient to predict, estimate and calculate each point concerned and 
the speed limitation of each wheel makes the problem worse. For example, the robot 
usually has the following behaviors: (1) Directions: front and back; (2) Speed: includ- 
ing 4 levels as left or right. So the speed combination of points that the robot can arrive 
is 2*4*4=32, and the available distance of each point is different. Therefore we choose 
the optimal speed combination of the left and right wheel from the 32 kinds of speed 
according to the dissimilar circumstances in the game field every period. 


Fig. 1. Demonstration algorithm sketch 


According to the evaluation of each point, we judge the weight value of the point 
and select the point which has the highest weight value as the objective of the move- 
ment. Available zone and evaluation factors assure the reachability of the concerned 
point so we need not to discuss more conditions. Assume t=80ms as a computer 
command period: 


(1) Concerning available zone: 


. orientations of each robot of our side, which influence the speed selection; 
. coordinates of all robots; 


(2) Concerning objective: 


=» — with obstacles or not, including the robots of the opponent and our own; 

=» — on the sideline or not, namely outside or not 

= _ the position of the ball relative to the robot; 

=» controllability of the ball, namely whether the attacking robot is approach- 
ing the ball; 

« the defending effect on the opponent member; 

= the importance coefficient of local zone; 

= attitude after movement, namely whether toward the goal or the next action 
to take; 

1. the amount of robot in each zone; 


From above we choose the max value. The assignment of weight values can be meas- 
ured by the importance of factors at each point, and the weight values of those points 
that the robot can't reach are defined zero. As long as a certain point is chosen, the 
corresponding speeds of the left and right wheel are ready to use. 
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3 Predict Demonstration Algorithm 


The circumstance of robot soccer game is complex, uncertain and changing rapidly, 
however the robot members’ action is much limited. So how to reasonably plan the 
restricted acts and obtain the efficiency is important. This following part discusses a 
predictive approach to decide the robot actions by the prediction of the effects based 
on demonstration algorithm. This model has the characteristic of high robustness, 
high efficiency and high antagonism. 

The main Agent's decisions are determined by the camera above the field which 
captures all information about the robots and the ball in the entire course of the game. 
The concerned information can be regarded as static in some relative time, while the 
trajectory of the robots and the ball is dynamic and continuous. So we must predict 
the future positions of the robots and the ball in order to succeed in obtaining the 
optimal path. As fig.2. shows, where AJ, A2, and A3 means respectively positions of 
the target at the former time(t-7), current time(t), next time (t+7T), whent > t+T (Tis 


the sample period of computer), the motive distance of the target is A, A, . Since the T 


value is very small, there is A,A, = A,A, approximately. So we can get: 


OA, = OA, + A,A, = OA, + AA, (1) 
0 x 
A AR 


Fig. 2. Inertia prediction sketch 


Inertia prediction algorithm is introduced as follows by a robot pursuing the ball 
example (as shown in fig.3.). Suppose that the current speed of the ball (size and 
direction given) and the position of robot R are known; Bt is the current position of 
the ball; Bt-7 is the former position of the ball; nT is the predictive time. Where 7 is a 
constant obtained from experiment, T is sample period. After calculating we can get: 


(1) The position of the ball after nT time, namely the coordinate of point Bt+nT; 
, the speed V’ of R can be ob- 


t+nT 


[nt (2) 


Obviously, the smaller n is, the smaller nT is, and the predict approach is more ac- 
curate. However, there is no inertia to the ball if m is too small, and there is an in- 
crease to step length which may cause overshoot and larger error if n is too big. So it 


(2) Assuming R is in regular speed, calculate [RB 


tained from Bt to BttnT as V’ = [RB 


ttnT 


Fig. 3. The predict method diagram 
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is very important to adjust appropriately the value of n. In addition, whatever n is, all 
the values obtained from predict method have an error margin and need continuous 
improvement. The prediction and demonstration approach are combined as predict 
demonstration algorithm (PDA). 


4 Simulation and Results 


The mirosot3vs3 experiment plat is provided by the robot soccer competition BBS for 
the hardware system simulating and decision training. Users can program their own 
decision-making system based on this platform. The develop environment is Win- 
dows2000 VC++6.0. The PDA has been applied in robot soccer obstacle avoidance. 
One case is considered about the simulating experiment: one robot of the own team, 
one robot of the opponent team and the ball in the match field. The simulating result 
is shown in Fig.4. As the obstacle position (here is the opponent robot which in regu- 
lar speed) are known at the current time and the former time, the position of the ob- 
stacle after nt time can be predicted. 


Target object 


Fig. 4. Simulation result of robot obstacle avoidance with PDA 


5 Conclusions 


With PDA, the computer predicts and analysis the position of the target in future time 
and uses demonstration approach in path planning. Experiments and simulating results 
show that the robot path designed by DDA is more preferable and rapid. Using the pro- 
posed approach to plan the robot track is equal to change the static circumstance into 
dynamic one, so it has good real-time ability and adaptability to environment. Experi- 
ments show that this algorithm can be applied in penalty kick and holding actions. 
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Abstract. SNR gain for programmable amplifiers is analyzed in this paper. The 
results show that programmable amplifier with amplifier-array (PACAA) has 
higher SNR gain compared with programmable amplifier with single Op amp. 
SNR gain of PACAA is over 20dB and direct proportional to the number of 
amplifiers connected in serial. The input signal of the amplifier is divided into 
two parts according to the frequency, one of which is signal if its frequency is 
below a certain value and another is noise if its frequency is over the certain 
value. To the same design requirements, PACAA not only has good SNR gain 
but also reduces demands for the O0dB bandwidths of Op amps. However, the 
integrated circuits with PACAA will increase the area of the dice. It is needed 
to consider how to obtain higher SNR gain and minimize the chip size. 


Keywords: programmable amplifier, SNR, gain. 


1 Introduction 


Variable gain amplifier can be divided into two types in engineering technology. One 
is automatic gain control amplifier (AGC) [1] [2], and the other is programmable gain 
amplifier (PGA) [3] [4] [5]. In signal detection and measurement systems, PGA is 
generally used as the front-end in measurement or data acquisition circuits for the 
gain control of analog input signals to meet the requirements of data acquisition and 
measurement [4]. 

There are two ways to achieve PGA. One is programmable amplifier constructed 
with an Op amp [4], and the other is programmable amplifier constructed with 
amplifier-array (PACAA). The gains of two kinds PGA are both programmable, but 
their technical features and design methods are relatively different. The different are 
embodied in the parameters of 0dB bandwidth requirement for Op amp and the SNR 
gain corresponding to the same amplification factor. SNR gains of programmable 
amplifier with single Op amp and PACAA are discussed in this paper. Results of the 
study show that PACAA has higher SNR gain, which is benefit for reducing the 
output noise of the amplifier. 

In this paper, frequency characteristics of the Programmable amplifier with single 
Op amp and PACAA are discussed in part 2; in part 3 and part 4, SNR gains of both 
kinds of programmable amplifiers are analyzed and discussed, respectively. 
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2 Programmable Amplifiers and Amplifier Array 


2.1 Frequency Characteristics of Non-inverting Amplifier Circuit 


The frequency response of the Op amp is directly related to the connection states of 
the circuit. Fig. 1 shows the closed loop feedback circuit of the Op amp. It is known 
that the Op Amp circuit can run steadily under the negative feedback state condition. 
Fig. | is a non-inverting amplifier with negative feedback structure. 


Fig. 1. Closed loop negative feedback structure of Op Amp 


In Fig. 1 A,(s) = a, /(1+ Ss /p,) is the open loop gain of the Op Amp, 4, is the 


low frequency amplification factor of open loop of the Op Amp, p, is the main pole of 


the Op Amp, and F is the negative feedback function. The transfer function of Fig.1 is 
H(s) = V,(s)/V,, (S) = Ag(s)/(1+ Ao (s)F) (1) 


Under the condition of the deeply negative feedback, A, =a, /(+ a,F) ~1 /F, wel 
let p, =p,(1+a,/A, ), therefore 


H, (8) =A, /(L-+s/p.) Q 


To non-inverting amplifier there isF =1/(1+R, /R), therefore, 


HO) Ay) (0 [rsas/(1+Be))) <a, tsps) ) 


where p, = p; (1+a,/(1+R,/R)) =p,a,/(1+R,/R), so 
Po =Piao/(1+ Ry /R) = P,ao/Ay (4) 


2.2 Programmable Amplifier with Single Op Amp 


A basic architecture of programmable non-inverting amplifier is shown as Fig.2. In 
Fig.2, A, is formed by connecting different feedback resistors. 


A non-inverting amplifier can be described with equation (2) and the amplitude- 
frequency characteristics are shown in Fig. 3, where 20dB thick dashed line 
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represents A, =10 and thick solid line represents A, = 100,A, =20lga,, f, and 


m 


f pi are input signal bandwidths and the frequency corresponding to the main pole of 
the amplifier, respectively. For the Op amp, Af is a constant. Therefore the unit-gain 


bandwidth is Te =a) (i.e. OdB bandwidth). For example, let maximum 


pl 


frequency of the signal is f, 


m =20MHz, and the amplifications of non-inverting 


amplifier with single Op amp are 10 and 100, then the Op amp's unit-gain bandwidth 
iS 20x10° x100 = 2GHz , ie. the OdB bandwidth of the Op amp has to be 


fy 2 2GHz. 


20dB/10oct 


Soi Sin 10fin 10°fr=fo 


Fig. 2. Non-inverting PGA with single Op amp Fig. 3. Bode plot of Fig.2 


From Fig. 3, suppose the low frequency amplifications of PGA are Ay, and Avy ; 
and A,, <Aj,,,, the 3dB bandwidths are from Ts to fs , then 


F=(Aus/Ata) fo (5) 


From equations (2) and (5), it can be known that there must be Te = he < can for the 


amplification A, , because the same Op amp is used. Equation (5) shows that if the 


programmable amplifier is designed with single Op amp, it must be taken full account 
of the requirements for the unit-gain bandwidth of the Op amp during it work with 
maximum voltage gain. 


2.3 Gain Programmable Amplifier Array 


Fig. 4 is an example of a programmable amplifier constructed with amplifier-array. It 
can be seen from Fig. 4 that the voltage gain of the amplifier depends on the 
connection of amplifiers. When the switches K1, K3 and K5 are closed, the voltage 
amplification factor is 20, while K1, K2 and K4 are closed, the amplification is 100. 
Among PACAA, the amplifications of the amplifiers are fixed, so their 3dB 
bandwidths are fixed. Since each level of magnification do not have to be large, 
therefore, the requirements for each Op amp's unit-gain bandwidth will be significantly 
reduced. Suppose the circuit with amplification factor 100 is realized by two amplifiers 
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with the magnification of 10 in series, and the maximum frequency of the input signal 
is fy=20MHz, then 20x10°x10=200MHz. This shows that the unit-gain bandwidth of 
the Op amp should be fy>200MHz. This means that the requirement for the Op amp’s 
unit-gain bandwidth is reduced 10 times compared with Fig. 2,. 


=H 
Vin R K2 
R 
” OR: go Yo 
atm Ky Kl 
KS oo R 
R 
OR 


Fig. 4. Amplifier-array 


Let A,, and A;, are low frequency gains of the two series amplifiers, Ay,A;,=Ar, 
and A,, <A,,. Meanwhile, assuming two amplifiers have the same unit-gain 


bandwidths (OdB bandwidth) Tos Le. foi=fpi»- 3dB bandwidth of the amplifier with 
largest gain is f[,=f, and fii<fta<fo. The transfer function of the amplifier in Fig. 4 is 


A.A 
Hea aaa (6) 
(1+8/p,, )(1+8/P2,) 
A+B 
B 
A 


Sola fn Sia oa 


(a) Different gaia (b) The same gain 
Fig. 5. Bode plot of the PACAA 
The Bode plot of equation (6) is shown in Fig. 5. The thick dotted line in Fig. 5(b) 
represents two amplifiers with the same magnification. It can be seen from Fig. 5, due 


to the unit-gain bandwidth of the amplifier is effectively reduced, the programmable 
amplifier has a better inhibitory effect to the out-band noise mixed in the signal 


3 SNR Gain Analysis of Programmable Amplifier Constructed 
with Single Op Amp 


Assuming the input signal is s(t)=s,(t) + n(t) , where the effective bandwidth of 
s,(t) is tas n(t) is the out-band noise mixed in the signal whose minimum 


frequency is greater than f,, . The SNR gain of an amplifier is defined as 
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D = 201g|SNR, /SNR,, (7) 


where SNR of the input signal is SNR,, =S,(s)/N,,,(S) and SNR of the output 
signal is SNR, =S,,(S)/N,(S) . For the PGA shown as Fig. 3, the Laplace 


Transform of the output signal Y, (S) is 


A A 
Y, (s) = ——— IS, (s) + N(s)] = A, S,(s)+ ————N(s)__ (8) 
1+s/p, 1+s/p, 
In Fig. 3, 3dB bandwidth is f, corresponding to A, , and f, 2 f,,can be met at 
any magnification. Let input noise is N(S)=N,,o(S)+N,,(s), where, N,.o(S) is 
the noise between f,, < fi and N,, (Ss) is the noise between f, < f,,. Therefore the 


output noise can be divided into two parts: 


1) For the noise between f, S f, , because it is in the pass band of the amplifier 


(Fig. 3), the output noise of this part is 
N,, (8) =A, N,o(s) (9) 


2) For the noise between Ty: < to , because it is outside the pass band of the amplifier, 


the output noise of this part is 


N,o(S) = ss) (10) 


— Lt ON 
1+s/p, 
Then, the output noise of the amplifier is N,(S) =N,,(S) + N,, (S). According to 
equation (7), the SNR of the output signal is 


S 
1+— 5 (s) 
SNR, = Po ie (11) 
4 SNno(S)_ | Ni, (5) 
p,N;, (S) 
Above all, SNR gain of the programmable amplifier with single Op amp is 
N 
D= 201g (1+ |—201g|1 + SNaols) (12) 
P2 P2N;, (S) 


Considering Pp, = p,ay/A,, setS = j@=j27f ,p,=27f,, and substitutes them 


into equation (13), equation (14) is received. 
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; xe 
D = 2011 + A+! — 201g + AND (13) 


Frito FritoNin(f) 


The graph is shown in Fig. 6. 


7 
ae 


20leh+-2— = 


Fig. 6. SNR gain of programmable amplifier with single Op amp 


Equation (13) shows that for programmable amplifier with single Op amp: 


1) If the noise within the pass band of the amplifier N 9 (S) is equal to 0, 
ie. f,, = f.- the SNR gain is: 


p= 201g + AL (14) 
pito 
Equation (14) shows that SNR of the output is larger than that of the input. 
2) If all the noise is in the pass band of the amplifier, 


D = 201g) + A“ |~ 201g + ZA+] = 0 (15) 


pito pito 


It shows that the SNR is not improved. 
3) If neither of N 0 (s) and Ne (s) is equal to 0, the SNR gain is smaller than 20. 


There is little improvement in low frequency noise but more obvious improvement in 
high frequency noise. This shows that higher SNR gain can be obtained in the case of 
higher gain in low-frequency while a smaller SNR gain can be obtained in the case of 
smaller low-frequency gain. Both cases significantly inhibit the high-frequency noise 
but the low-frequency noise suppression is not obvious. 


4 SNR Gain of the PACAA 


Consider the PACAA in Fig. (4), it is composed by two amplifiers in series (where 
the amplifier 3 is always included in any combination). Using equation (5), equation 
(16) is received. 
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Ay Ar 
(T5/p.,)(1+5)P) 


In order to compare with programmable amplifier with single Op amp, let 
A.A =A;andA,, <Aj,,, meanwhile, both of the Op amps have the same 


Y,, (8) = [S,(s)+ N(s)] (16) 


unit-gain bandwidth (0dB bandwidth), f,, < fi, < foand fy, = Son - 


Assuming f}, = > the effective signal in the output is 


S, (s) = A, S,(s) (17) 
The noise in the output is 
A,,A 
= i —___N. (s) (18) 
(I+s/p,.)(1+5/P>) 
Similarly, assuming 
A, Ars 
[N,,(S)+N,,(8)] (19) 


 (5)p.,)(+9Px) 


Considering f,, = fi, < fia < So. it is received like this: 


1) For the low gain amplifier which is in the input port, considering there is noise 
in the pass band, from equation (9) and (10), the output noise of first amplifier is 


1+sN_,,(s)/(p,,N,, (s)) 
1+s/p,, 


N,, (8S) = Na (8S) + N,.9 (S) = Ay, Je (s) (20) 


2) For the high gain Op amp, there is no noise in the pass band, 


A Lb 


—_—"_ -—-N _(s 21 
1+s/p,, a8) oP 


ob 


Substitute equation (20) into (21), the noise in the output is 


A,,A,, { 1+sN N, 
N, (8) =N,,(s) = Suse | LF SNao (8) MPa Ni) |x (gy (22) 
1+s/p,, 1+s/p,, 
With complex frequency form, SNR gain is 
A,,N 
D=20lg|l+ VA + 201g {1+ Ay — 201g /1+ 4 PAN D) uNuolS J (23) 
tab ab lab“ ab Fiav2oav Nin (P) 
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Compared with equation (14), there are two positive 20dB in equation (23). The Bode 
plot is shown in Fig. 7. It can be seen that SNR gain of PACAA is 20dB larger than 
that of programmable amplifier with single Op amp, and higher SNR is received. 


faa | 


f lab 9 
At, 


labo 


—~ 20Igll + 


Fig. 7. SNR gain of PACAA 


5 Conclusion 


SNR gain is one of the basic requirements for signal conditioning circuits in signal 
measurement systems. This paper points out that PACAA has higher SNR gain 
compared with programmable amplifier constructed with an Op amp. In normal 
conditions, when other design requirements are the same, PACAA reduces requirements 
for unit-gain bandwidths of Op amps besides the better SNR gain. Therefore, it 
makes the amplifier design easier. However, compared with programmable amplifier 
constructed with an Op amp, PACAA will increase the area of analog circuits in 
integrated circuits design. Therefore, it is required careful analysis to obtain the lowest 
possible chip area under conditions of high SNR in relevant integrated circuits design. 
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Abstract. In this paper, the system’s verification testbench is established 
according to the functional requirements of a mixed-signal SoC system. The 
testbench tests the system processor’s control on the analog signal input 
circuit’s magnification and ADC’s sampling frequency. The simulation results 
with Modelsim indicate that the SoC System has achieved the expected 
functional requirements. And the testbench validates the accuracy of the SoC 
design well, reducing the risk of tape-out. 


Keywords: mixed-signal SoC, system function, testbench. 


1 Introduction 


Owing to the high performance, high integration and high complexity, the cost of the 
chip’s design and taping-out is also high. Therefore, the system’s function must be 
verified to ensure the correct function of SoC chip system before taping-out or placing 
and routing. In the whole process of the SoC chip design, the proportion of simulation 
and verification is increasing, and the logical inaccuracy is the main reason to cause 
the failure of SoC chip design and taping-out. Therefore, using the advanced design 
and simulation methods is the key to the success of SoC chip design. This can not 
only reduce the risk of SoC design taping-out and the cost, but also greatly reduce the 
SoC chip’s development cycle [1]. 

Currently, SoC verification methods can be divided into four main categories: 
verification method based on simulation, static verification method, formalization 
verification and physical verification. Static verification method is to verify the timing 
information of the specific circuit; physical verification method is applied to the 
physical layer of chip design. Therefore, SoC system verification methods are mainly 
verification method based on simulation and formalization verification [2]. And 
which is commonly used is software and hardware co-verification method based 
testbench. 

In this paper, a software simulation testbench for a mixed-signal SoC system is 
established, which is used to carry out system’s simulation verification to ensure the 
realization of its functional requirements. In part II, the structure and functional 
requirements of the mixed-signal SoC system are described. Part III provides the 
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simulation testbench of the mixed-signal SoC system, and explains the specific 
method for testing its function. And the results of simulation are given in part IV. 


2 A Mixed-Signal SoC System 


2.1 System Structure 


The system which is needed to establish testbench is a complex circuit mixed with 
digital-analog circuits, and is composed of some modules such as a set of analog signal 
input circuits, ADC, processor system, and so on. The structure is shown in Fig.1. 


Synchronizing Clock 


: Serial Port 
Analog Signal |_,. apc |—»|Processor | _____» 
Input Circuit ' System 


Fig. 1. Structure of a mixed-signal SoC system 


As Fig.1 shown above, the system's basic function is to complete the acquisition of 
the analog signal and control the amplifier and ADC by processor system. 
Specifically, the analog signal into the system first passes an analog signal input 
circuit with a magnification, then goes into ADC converting analog signal into digital 
signal. The digital signal output from ADC is stored into the processor system. At the 
same time, the processor system can control the selection of the analog input circuit 
magnification, and the sampling clock of ADC. 


2.2 Functional Requirements 


The functional requirements of main modules are as follows: 


Analog Signal Input Circuit: amplify the input analog signal. The circuit includes a 
set of amplifiers with four different magnifications which are controlled by four 
switches to choose the different amplified paths. Users can choose to set an amplifier 
with a magnification in four according to the actual need. 

ADC: achieve to convert analog signal into digital signal, and the conversion rate is 
controlled by the processor system. 

Processor System: realize the control of the analog signal input circuit’s magnification 
and the control of ADC’s sampling frequency, receive or transmit data through the 
serial port. 

Serial Port: achieve data communication with the outside. 

Synchronous Clock: provide the control signal of the mixed-signal SoC system. 
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3 Testbench Design 


This mixed-signal SoC system is a complex circuit with digital-analog circuits, and 
each module is a function IP core in the simulation. After completing the simulation 
of each IP core, it has to establish the system’s simulation testbench. The testbench is 
mainly to test the processor system’s control on the analog signal input circuit’s 
magnification and ADC’s sampling frequency, detecting whether the system has 
realized the functional requirements. 


3.1 Structure for Test 


Each module in the system has to generate the corresponding IP core, and the digital 
signal generator is used to replace the analog signal input circuit and ADC. The whole 
structure used for test is shown in Fig.2, and the description of each module's function 
is shown in Table 1. 


; Processor 
Signal Generator System Uart Test 


Programm Design 


Fig. 2. Structure for test 


Table 1. Description of each module's function in Fig.2 


Name Function 

Signal ; 1 SAE 

Goneeaior Provide digital signal to Processor System 
Store digital signal, and control Signal 

Processor see < 

Syst Generator which is used to replace amplifier 

ae and ADC 
Transmit and receive data With Processor 

Uart test ; é BY, 
System to achieve the serial communication 
Achieve the functional verification of the 

Baseiaiik system through programs, mainly including 

Dasiow the acquisition of the input signal and the 
control of Signal Generator which replaces 
amplifier and ADC. 


3.2 Testbench 


The established testbench is mainly to test the processor system’s control on the 
analog signal input circuit’s magnification and ADC’s sampling frequency. The 
structure and the associated control signals are shown below in Fig.3 and Table 2. 
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clk_ade rxd_t 


ade_start txd_t Uart Test 


amp [3:0] 
+—[Prosram desisn] 


adc_out [7:0] 
Fig. 3. Structure of system testbench 


clk_adc |}<——— 
adc_start |<——— 
amp [3:0] | «—— 


adc_out [7:0] | ——> 
Signal Processor 
System 


Generator 


Table 2. Description of each module's function in Fig.3 


Name Description 

clk_ade The sampling clock of ADC 

adc_srart The start signal of ADC 

Amp/[3:0] Choose the path with different magnifications 
adc_out[7:0] The output of ADC 

Txd Transmitting data of system serial port 

Rxd Receiving data of system serial port 

txd_t Transmitting data of testing module serial port 
rxd_t Receiving data of testing module serial port 


The testbench is used to test the analog signal’s magnification and the ADC’s 
sampling frequency controlled by the processor system. And this control is achieved 
by the controlling registers. 

In the test, the processor system can control the signal’s magnification. The 
corresponding controlling register goes through the 2-4 decoders, and then exports the 
control signals. The signals control the relay switch to select the amplifier. The four 
configurations of decoder are corresponding to the four different signal amplifiers. In 
Table 3.3, when the reg[1:0] = 00, the output amp[3:0] = 0001, which means to select 
the first signal amplification path. The truth table is shown in Table 3. 


Table 3. The truth table 


reg[1] Reg[O] | amp[3] amp[2] amp[1] amp(0] 
0 0 0 0 0 1 
0 1 0 0 1 0 
1 0 0 1 0 0 
1 1 1 0 0 0 


The control register in processor system can be configured directly by the 
assembler, and the program is as follow. In program, control register named 
REG_AMP is configured with 0X01(00000001). So reg[1:0] = 01, and when it is 
enabled, the corresponding output is amp[3:0] = 0010, namely 2, which means to 
choose the second signal amplification path. 


REG_AMP=0X01; 
REG_AMP_EN=0X01; 
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The controlling register for ADC controls to generate the clock and the start signal. 
When | bit in the control register namely reg[0] is 0, ADC's sampling clock is the 
same as the system clock; otherwise the clock offered to the ADC is only half of the 
system clock. The control register can be configured directly by the assembler too, 
and the program is as follow. In program, control register named REG_ADC is 
configured with 0X01(00000001). So reg[O]=01, and when it is enabled, the 
corresponding output clk_adc is only half of the system clock. 


REG_ADC=0X01 
REG_ADC_EN=0X01; 


Of course, the control registers can also be configured through the serial port. 
About serial communication, it requires the same baud rate between system and the 
uart test module. TMOD = 0x20 means both using timer Tl which works with mode 
2; THI = Oxe6, TL1 = Oxe6, setting the same initial value of timer T1; TRI = 1 means 
to start timing. When transmitting, SCON = 0x40 sets the serial port with mode | to 
send data. And when receiving, SCON = 0x50 sets the serial port with mode | to 
allow reception. So that it can achieve the serial communication. 


4 Test Results 


The simulation based on testbench, is to simulate the function of the mixed-signal 
SoC system using assembler through Modelsim. The acquired data of the system is 
stored in processor system. The result is shown in Fig.4. 


b1_tb/oc8051_sram/data_in 
f1_tb/oc8051_xram/wr 
p1_tb/oc8051_xram/stb 
h1_tb/oc8051_xtam/ack 


Fig. 4. Test result (1) 


According to the design requirements of the address space, the controlling registers 
are written directly by instructions. The configured result is exported by processor’s 
PO port, and the result is shown in Fig.5. The register is configured with fcg = 11, acg 
= 21, and when fcg_en and acg_en are enabled with 1, feg_o = 11, acg_o = 21, and 
export through PO port. 


Fig. 5. Test result (2) 
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It can also be configured through the serial port, as shown in Fig.6. SCON = 0x50 
means it allows to receive data. When sbuf=01, fcg_o=01. It makes amp [3:0] = 0010, 
namely 2, which means the system chooses the second amplifier to amplify the signal. 


-No Data- | ef 
-No Data- 

-No Data- 
-No Data- 

-No Data- fi: 
-No Data- 

-No Data- 


-No Data- 

-No Data- 

-No Data- 
_sfri/tt] |-No Data- 
_sfri#th1 |-No Data- firs 

-No Data- 40 

-No Data- 

-No Data- fies 
_sfl/ted | -No Data- 
_sfi/txd = |-No Data- 

-No Data- 

-No Data- 

-No Data- 

-No Data- 


Fig. 6. Test result (3) 


Through directly configuring the register to control the ADC’s sampling 
frequency, it is the same as the system clock or only half of the system clock. The 
result is shown in Fig.7. When acg_o = 00, clk_adc is the same as the system clock 
‘clk’; when acg_o = 01, clk_adc is only half the system clock ‘clk’. 


LT TL 
LALLA A oo 


Fig. 7. Test result (4) 


The test result of serial communication is shown in Fig.8, and when the baud rate is 
set the same, SBUF register of the receiving module can receive the data of 
transmitting module. SCON = 0x40 means it allows to transmit data, and SCON = 
0x50 means it allows to receive data. The data is 01,02...... , and exported by PO port. 
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|_sfrt/ted [Stl 


Fig. 8. Serial communication 


Through the above test results, the testbench design of the mixed-signal SoC 
system has completed the acquisition of analog signals, and has realized the processor 
system’s control of amplifier and ADC. 


5 Conclusion 


This paper provides a simulation testbench for a mixed-signal SoC system, which is 
to verify the system’s function by the software and hardware co-verification method. 
The results validates that the mixed-signal SoC system has realized its functional 
requirements, such as the processor system’s control of amplifier’s magnification and 
ADC’s sampling frequency. The establishment of the mixed signal SoC testbench, not 
only reduces the risk of SoC design and the cost, but also greatly reduces the 
development cycle of SoC chip. With the development of verification methodology 
among the design of SoC and other chips in the future, more scientific and advanced 
verification techniques will appear [9]. 
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Abstract. Based on Altera EP2C70, a block-based method is proposed to op- 
timize the Mixed-SoC design in this paper. A rapid verification platform for 
SoC prototyping system is studied and implemented. The Hardware/Software 
parallel design and co-verification are realized to greatly reduce the chip time to 
market. The verification platform establishes an integrated data stream from the 
PC to the targeted system, and solves the problems of the accurate, predictable 
and real-time sending, transmission, acquisition of the SoC verification system. 


Keywords: Mixed-SoC, FPGA, Block-based, Co-Verification, Data Acqusition 
System. 


1 Introduction 


With the development of large scale integrated circuits, integrated circuits has entered 
SoC (System-on-Chip) era. The difficulty of verification is increasing rapidly due to 
the constant expansion of SoC. Currently, the success rate of tape-out for the first 
time is only about 35%, which is largely due to the verification. SoC verification 
consumes 60 to 80% '"! of the whole design time, which has great negative impact on 
the return on investment and time to market. 

With the ever-increasing complexity of SoC and the increasingly urgent time to 
market, more attentions and concern are paid to the block-based design SoC prototype 
verification in the SoC design and verification. Rapid system prototyping, co- 
verification of hardware prototype and software prototype, has become a common 
verification method in the early stage of SoC design process. This powerful 
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FPGA-based hardware verification platform can quickly achieve Hardware modules 
in SoC design, which greatly reduce the chip time to market. 

In this paper, the data acquisition system is designed hierarchically using module- 
based design method of Altera QuartusII. Each module is optimized efficiently only 
once. This method significantly reduces design iteration time, reserve performance 
characteristics unprecedented and greatly improves work efficiency. On this basis, 
study the design of the hardware verification platform based on ALTERA EP2C70 
FPGA and the implementation of 8051 SoC rapid prototyping system, and then 
complete the Parallel design and co- verification of hardware and software. 


2 Mixed-SoC Design Optimization 


The main features of Mixed-SoC are IP-reuse and HW/SW co-design. General design 
flow is shown in Fig. 2.1[5]. This paper presents a block-based HW/SW co-design 
method to optimize the Mixed-SoC design flow. This method significantly shortens 
the total development time and realizes highly efficient team design. Designers can 
complete the high-density FPGA design iterations 4 to 5 times per day with this me- 
thod while they can finish the high-density FPGA design iterations | to 2 times per 
day with the traditional design method [3]. 

First, divide the entire design into software components and hardware components 
according to the design specifications. Then, divide the function parts of the software 
and hardware into blocks. Every block is designed, implemented and imported into 
the top engineering individually. Some of the functional blocks commonly used in the 
system are modular designed, debugged, packed and prepare for call. The general 
principles of module division are as fellows. The contact inside the module should be 
close. The function of each block is independent. The connection between the blocks 
should be as simple as possible. 


Syslem requirements 


systoms definition 
syslein belt ior Deseripiion Syste ard iiweiure deseriplion 
alse Frout-enel 
t Ga signal > al 
eH ; input signal a is ADC 

System functional + eriiivaion a adjustment 

v 
| Saliva ured lemslware division -—_— 
| y | 
Hardware Description Software Desenption S 
80C51 IP core 
Veal 
yeclor 


a a 


Syslum verilication 


Tarde verifieation Saltware verification [ 


: 


+ Interface Serial 
circurt Interface 


TTurdyeate Teka design 


Fig. 1. SoC HW/SW co-design process Fig. 2. Modular data acquisition system 
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In this paper, the data acquisition system is a Mixed-SoC system. Its modular 
structure is shown in Fig. 2. The front-end signal adjustment circuit and ADC conver- 
sion are realized with analog circuit. The interface circuit and serial interface are 
realized with the digital circuit. The 80C51 IP core as the core control unit control 
the analog input, signal adjustment, A/D conversion, digital data transfer and storage, 
serial port communication.The design and implementation of 80C51 soft core are 
completed in the software design phase. The modular structure of 80C51 is shown in 
Fig. 3[1]. 


External Clock. External Event Counter- 
RON - 
ALU Oscillator ys 7356 Timer x 2 
& Clock - a (16 bit) 
Control d [ [ 
Unit. : ] 
RAM Internal Interrupt | | 
SFR + . MV f 
Control ey Expansion Serial. P Nea J 
Unit Interrupt Bus Interface. sty . 
SOCST Control controller: — : 
73 
ae HT ft 4 
- Extemal Interrupt» Control Data/Address» RXD- TXD.- 


Fig. 3. Modular 80C51 IP soft core 


3 SoC Hardware Verification Platform Design 


The synthesis result of 80C51 SoC shows that 3121 logic elements are used. We 
choose EP2C70 considering the expansibility and cost-effective of the SoC 
verification platform, which is shown in Fig. 4. 


Serial ALTERA Adjustment 
commun}—) Cyclonell circuit 
ication EP2C70 I 


i ! ADC 


Power | | Reset} | Clock 


Fig. 4. SoC hardware verification platform 


3.1 Hardware Module 


The hardware circuit of the SoC verification platform includes the power circuit, 
reset circuit, clock circuit, a serial interface circuit, the front-end signal adjustment 
circuit and the ADC conversion. 
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3.1.1 SoC Verification Circuit Requires Multiple Powers 

The main power is 5v, 3.3v and +1.8V, and the 3.3v and +1.8V are conversed from 5v 
by the integrated voltage regulator LM1117. The current load capacity of the 5V 
power is no less than 5A, and the current load capacity of the +1.8V power is no less 
than 800mA. 


3.1.2 System Clock 

The SOMHZ system clock, generated by the active crystal, is linked to CLK1 of 
EP2C70.24MHZ block, generated by the internal PLL, is supplied to the 80C51 
soft-core. 


3.1.3, Reset Circuit 

System reset signal becomes effective automatically after remains high more than two 
machine cycles. This paper uses RC reset circuit with a gate to improve the stability 
and performance of the reset circuit. The reset circuit is showed in Fig. 5[7]. 


3.1.4 Serial Communication 

Level conversion between RS232 and TTL is required to realize the serial 
communication between the system chip and PC. The conversion circuit is shown in 
Fig. 6. 
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Fig. 5. Reset circuit Fig. 6. Serial communication 


4 The Transplant From SoC to FPGA 


SoC rapid Prototype system Verification is different from other verification method. 
Any of the other verification methods is part of SoC verification process, while the 
SoC rapid Prototype system Verification is an entire process. SoC is based on 
standard cell library, while FPGA is based on themacro-cell block provided by the 
manufacturers. 

Since SoC and FPGA are different in physical structure and performance, the RTL 
code should be modified first [2]. Then the mapping tools optimize the RTL code 
based on constraints and map the basic unit of the selected FPGA device to the net 
list. If the timing satisfies the constraints, the download can be realized using 
configuration file [6] If not, we can confirm the critical path according to the timing 
report to optimize the timing. 
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Since the synthesized 80C51 IP soft core does not include ROM and RAM, we can 
use the ROM module provided by QuartusII 7.0 to realize the function. The HEX file 
for test is embedded by “Mega Wizard Plug-In Manager” provided by QuartusII7.0. 

The specific verification process is as follows: 


1) Create a new project, add the SoC project files including 80C51 IP soft core and 
then synthesize the RTL code calling the ROM, RAM and PLL provided by Quar- 
tus IT7.0. ROM modules need to be modified according to actual situation. 

The synthesis maps of RTL are shown in Fig.7 and 8. The resource consumption 
reports after the synthesis based on FPGA is shown in Fig. 9. 

2) Write the test file into ROM, and simulate with ModelSim SE6.2b. 

3) Complete the pin configuration referencing FPGA pin reference manual, and com- 
piled a downloadable executable file (Top.sof). The resource consumption reports 
after the download is shown in Fig. 10. 


Fig. 8. Synthesis map of 80C51 IP core 


Analysis & Synthesis Status Suecessful - Sun May 16 15:32:18 2010 
Quartus IT Version 7.2 Build 151 09/26/2007 SJ Full Version 
Revision Name top 
Top-Level Entity Hane top 
Family Cyclone II 
Total logic elements 3,121 
Total combinational functions 3,121 
Dedicated logic registers 1, 798 
Total registers 1798 
Total pins 102 
Total virtual pins 60 
Total memory bits 553, 728 
Embedded Multiplier S-bit elements 1 
Total PLLs 1 


Fig. 9. Resource consumption reports after the synthesis 
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Flow Status Successful - Sun May 16 15:35:36 2010 
Quartus II Version 7.2 Build 151 09/26/2007 SJ Full Version 
Revision Name top 
Top-level Entity Name top 
Family Cyclone II 
Device EP2CTOF896C6 
Timing Models Final 
Met timing requirements No 
Total logic elements 3,823 / 68,416 (6%) 
Total combinational functions 3,126 / 68,416 (5%) 
Dedicated logic registers 1,798 / 68,416 (3%) 
Total registers 1798 
Total pins 102 / 622 (16%) 
Total virtual pins 60 
Total memory bits 553,728 / 1,152,000 ( 48 % ) 
Embedded Multiplier Q-bit elements 1 / 300 (<1%) 
Total PLLs 1/4 (25%) 


Fig. 10. Resource consumption reports after the routing 


5 Co-verification of the SoC Prototype Verification Platform 


The next task is to carry on the Co-verification of the SoC prototype verification plat- 
form. HW/SW co-verification environment is shown in Fig. 11. 
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Fig. 11. HW/SW co-verification environment 


Verification and debugging process is as follows: 
1) Control the verification platform under the control instructions issued by PC 
Applications. 80C51 control the periphery by configuring registers and output processed 
signal in parallel and serial way. The input analog signal is shown in fig. 12. 


Fig. 12. Input analog signal 
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2) Observe the run results 

a) Observe the feedback information of the hardware verification platform with serial 
debugging assistant terminal. The processed input analog is transferred to PC, which is 
shown in fig. 13. The magnification is set two. The result is consistent with expected. It 
shows that the SoC logic function and serial communication is right. 
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Fig. 13. The serial output data 
b) Observe the data from the FPGA internal parallel port by the embedded Signal Ta- 


pl provided by the QuartusII 7.0. The fig. 14 shows that the parallel data is consistent 
with the expected and serial data. 


lo 200083 2017109 40 ¢ i 


Fig. 15. The signal after DA conversation 
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c) The parallel digital data is transferred into analog signal by DA conversation, 
which is shown in fig 15. It shows that the signal is amplified by two as expected. 


6 Conclusion 


A block-based HW/SW co-design optimization method is proposed in this paper 
based on Altera EP2C70 FPGA prototype system to optimize the Mixed-SoC design 
flow. Then, a real-time hardware verification platform is developed to monitor and 
analysis the behavior and realize the close and flexible coupling of hardware and 
software. Finally, an independent verification platform is constituted to carry on the 
field monitoring. Code generated by the HW/SW Co-design in real-time is run on the 
platform. Control of the entire verification platform is realized and an optimize 
verification result is achieved. 
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Abstract. A video decoding coprocessor used as H.264 main profile is pre- 
sented. In the coprocessor, some decoder accelerating modules are involved, 
such as inter predictor, intra predictor, IDCT and weighted prediction. When the 
frequency of the system clock is 133MHz, the average time for this coprocessor 
to decode 720x480’s P frame is 9.2ms. After being synthesized by Design Com- 
piler, 90k gates are totally needed, including 1Kbyte RAM. This coprocessor can 
be easily integrated in ARM-based SoC and some other processor- based SoC 
after modification. 


Keywords: SoC, H.264/AVC, coprocessor. 


1 Introduction 


H.264/AVC is the newest, state-of-the-art, video compression standard [1]. Compared 
with other standards, H.264 has higher compression rate because of many new adding 
tools, such as flexible inter prediction, CABAC, integer transform and so on. Also 
much more resources are needed for H.264 decoding, and it will take much more time 
for general embedded CPU to decode large size H.264 frame. Therefore, in embedded 
systems or SoCs, decoding coprocessor is needed to accelerate the decoding process 
in order to meet the requirements of video communications, HDTV and so on. 

This paper describes a coprocessor used as H.264 main profile decoder. In the co- 
processor many decode accelerate modules are include, such as inter predictor, intra 
predictor, IDCT and weighted prediction. AHB slave interface is added in the copro- 
cessor to configure the register files used to control calculation progress and transfer 
decoder parameters. DMA interfaces are used to fetch reference frames and transfer 
the final results to external ram, and also to fetch the calculation parameters when the 
coprocessor is worked on parameter buffer mode. 

In the simulation at 133MHz clock, the average time for this coprocessor to decode 
720x480’s P frame is 9.2ms. After synthesis used Design Compiler, 90k gates are 
needed, include 1Kbyte ram. So this coprocessor can be easily integrated in ARM 
based SoC. The AMBA slave interface can be modified to other bus interface so the 
coprocessor can be used in other bus system. 
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2 System Overview 


2.1 Hardware/Software Partition 


There are two major categories of hardware-software partitioning methodology, 
ASIPs (Application Specific Integrated Processors) and processor-coprocessor sys- 
tems [2]. ASIP will modify the main processor core instructions to achieve applica- 
tion-specific function, so the compliers, libraries, operating system functions, and 
simulation and debugging environment must also be modified. But in processor- 
coprocessor mode, application specific coprocessor is added and is controlled through 
register files configuration by the main processor. So, little modification is needed to 
software development environment. 

Most of the SoC designs are based on ARM or other fashion processors. To modify 
these processors’ instructions will cost much for software development and other 
application work. And because this design is mainly used in AMBA bus system, so 
we design this H.264 decoder as a coprocessor which has an AHB slave interface to 
the main processor for decode control and parameters transfer. 

In the coprocessor, main calculation modules, which cost most calculation time 
and less dependent on the software parsing process of NAL, including inter predictor, 
intra predictor, IDCT, weighted predictor, are designed. The calculation is based on 
macro block, which includes a 16x16 Luma block and two 8x8 Chroma blocks. Soft- 
ware configures the control and calculation parameter of one macro block, start the 
decoder, and wait for the end of decoding process, then configure another macro 
block parameters. 


2.2 Decoder Parameter Configure 


Register files are used to store macro block parameters. There are two methods to 
configure the register files. One is configure through AHB slave bus at the beginning 
of every macro block decoding. The other is software write parameters of many ma- 
cro blocks to memory, and coprocessor fetches the parameters through DMA to regis- 
ter files. 

The first method does not need much memory, but software must wait coprocessor 
decoding end. Relatively the second method needs much memory, but software 
needn’t wait for coprocessor, so the software can work more independently. 


2.3 System Architecture 


The final coprocessor architecture is showed as Fig.1. 

AHB Slave and DMA interfaces are used to configure register files in control part. 
IDCT, Intra and inter predictor and other calculation modules are included in calcula- 
tion part. There are three DMA interfaces in calculation part: one is used for fetching 
IDCT source residual data, the other is for fetching reference frame for inter predict, 
and the third is for the final decoded macro block output. 
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Fig. 1. H.264 coprocessor architectrue 


3 Inter Predictor 


The inter prediction in H.264 has been made more flexible. This increased flexibility 
is one of the important changes which improve the level of compression achievable 
with H.264 [3]. 

In inter prediction; a number of different block sizes can be used: 4x4, 4x8, 8x4, 8x8, 
8x16, 16x8, and 16x16. The reference frame number can be up to 16, and each 8x8 block 
can have respective reference frame. So one B frame macro block, which include four 
8x8 Luma blocks can have 8 reference frames at most. The interpolation position can be 
half pixel or quarter pixel. First a 6-tap filter for obtaining sample values at half pixel and 
afterwards a 2-tap filter for calculating sample values at quarter pixel positions [4]. 
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Fig. 2. Inter predictor architectrue 
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The architecture of inter predictor is shown as Fig.2. Access Control unit controls 
access to DMA for reference macro block, block control unit is used for different 
block partition control, and computation control unit take charge of row or column 
interpolation data input to interpolation unit. Interpolation unit has 6-tap and 2-tap 
filters and 4 transpose rams to store the temporary data. 


4 Intra Predictor 


Intra prediction uses row or column of macro block to predict the decoding macro 
block. A number of intra prediction modes are provide in H.264: nine modes for 4x4 
Luma block, and four modes for 16x16 Luma block and Chroma block [5]. 

Because intra prediction needs the neighbor row or column of the neighbor block 
in the same frame, these neighbor data must be write as back to register files as calcu- 
lation parameters before computation begin. 
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row && column 


Fig. 3. Intra predictor architecture 


5 IDCT and Result Compensation 


IDCT in H.264 is simplified compare to MPEG4 and other standard. 4x4 integer trans- 
form is used and only shift and add operation is needed [6]. The 2D IDCT is imple- 
mented as two 1D IDCT operation. IDCT module architecture is shown as Fig.4. 


DMA 
ah 1D IDCT > 
Transpose 
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Fig. 4. IDCT architectrue 


Result compensation adds the IDCT results and intra or inter prediction results, 
gains the final decoded results. Weighted prediction function is added in this part. 
When weighted prediction enable, weighted coefficients configured in Register Files 
are fetched and used for weighted compensation. Fig.5 shows the Result compensa- 
tion module architecture. 
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Fig. 5. Result compensation architectrue 


6 Conclusion 


A verification platform based on System-Verilog is developed, including ARM VMM 
module, external DDR SRAM, internal data ram. The test clock is set to 133MHz, 
and the average decoding time for different size video is noted as the follow Table 1. 


Finally in TSMC 90nm, synthesis shows the critical path is 6ns, and 90k gate is 
needed. 


Table 1. H.264 Coprocessor simulaion proformance 


Image Size Decoding Time 
176x144 0.9ms 

Forman 320x240 2.8ms 
352x288 4.2ms 
720x480 9.2ms 
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Abstract. Compared with the fingerprint, finger vein is blood vessel network 
under finger skin, which is unique for each individual and hard to be forged. In 
this paper, an authentication system based on Nios II soft-core CUP is de- 
signed, which uses finger vein features as identity information. To overcome 
the problem of high time-consuming of software algorithms brought by the use 
of Nios II , a method of hardware acceleration is introduced. Through time anal- 
ysis of various parts of the image processing algorithm, high time-consuming 
algorithms are hardware accelerated and the requirement of real-time system 
is met. 


Keywords: finger vein, Nios II , FPGA, image processing, hardware 
acceleration. 


1 Introduction 


How to accurately identify the individual's identity information, and protect their 
information security, is one of the problems need to be solved in today's information 
society. Finger vein recognition technology is an emerging biometric identification 
technology. Compared with the fingerprint, finger vein is a blood vessel network 
under finger skin, which is unique for each individual and hard to be forged. Accord- 
ing to statistics, only 8 per 1,000 people have similar hand vein distribution [1]. Fin- 
ger vein image capture is easy, and takes up small storage space. So, as an important 
feature on authentication system, the biometric finger vein network also has the stabil- 
ity, high accuracy, low cost, and non-invasive characteristics. 

The system is based on Nios II soft-core processors, which realize the information 
of finger vein characteristic processing systems. Compared with the PC implementa- 
tion, the system is low cost, low power consumption, high flexibility, reconfigurabili- 
ty and easy to realize. The system in this paper uses human finger vein as the identity 
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information. First, capture the user's finger vein of information in real-time. Then do 
the finger vein image processing. Finally compare the result of image processing and 
original personal information in the database to make the information authentication. 
So, customers need to get their finger vein information collected and stored in the 
Finger Vein Information Database which is established firstly. As finger vein is less 
influenced by the growth of human body, it can be ensured that the database will be 
available in no less than 5 years. 

Finger vein information processing system is a real-time image acquisition and 
processing system, which need to run and finish within the given time. Through time 
analysis of various parts of the image processing algorithm, high time-consuming 
algorithms are hardware accelerated and the requirement of real-time system is met. 

The rest of this paper is arranged as follows: Section 2 gives a brief introduction of 
the hardware architecture of the system and functional description of each module. 
Section 3 describes the image processing software algorithms, and analyzes the time- 
consuming and reasons of each parts. Section 4 introduces the hardware acceleration 
to the main time-consuming algorithm. Some conclusions of this paper are given in 
section 5. 


2 System Hardware Architecture 


Using Nios II soft-core as the platform of image processing and success in the FPGA 
transplant, the system includes two parts: Hardware image acquisition and software 
image data processing. The hardware system is divided into outside image acquisition 
devices and FPGA embedded platform. The acquisition device connected to the sys- 
tem through the analog video port on the development platform. Nios soft-core is 
used for running the feature extraction algorithms and the authentication process. By 
designing the appropriate custom peripherals to accelerate algorithms and ensure the 
system's real-time. Connecting to the database through UART, users’ characteristics 
and their original information in database were compared. Meanwhile, AES algorithm 
is added in the system to ensure the confidentiality of users’ characteristics informa- 
tion. The system architecture is shown in Figure 1. 


2.1 Finger Vein Image Acquisition Device 


With a wavelength of 700nm-1000nm infrared light irradiation fingers, most organiza- 
tions will be penetrated. As infrared light will be absorbed by hemoglobin in the finger 
vein, the transmission infrared light from the vein network portion is weak, and none 
vein network areas one is strong. With the help of image sensor, the finger vein net- 
work topology is formed. The acquisition system is shown in reference [3]. 


2.2. IP Design 


Beside Nios II, there are other IP cores designed in FPGA, including: Image acquisi- 
tion control and decoding module, image hardware-accelerated processing module, 
SDRAM interface control module, VGA display control module, FIFO, on-chip 
shared RAM and custom peripheral module, which are shown in Figure 1. There are 
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two independent data channels after video decoding. One is through the SDRAM 
controller and the VGA controller which directly output to VGA monitor, the other is 
through the image hardware-accelerated processing module and FIFO, then stored in 
the On-chip RAM which is shared by Nios II. 


(1). Image acquisition control and decoding module: This module is used for con- 
trolling video decoding chip ADV7180 via I2C bus for the analog signal capture. 

(2). Image hardware-accelerated processing module: Since only the gray scale is 
used in the image date processing, in order to streamline the storage space and speed 
up image processing, the module is used for capturing a frame image from the NTSC 
standard data flow and extracting the Y component. This module is controlled by 
Nios II, and provides the data address defined by the user. 

(3). FIFO and on-chip shared RAM: As the clock of the video capture is 23MHz 
and the on-chip shared RAM is 100MHz, so FIFO is used in the cross-clock domain 
part of system. The on-chip shared RAM is used for storage the image data shared by 
FIFO and Nios II. 

(4). SDRAM control module and VGA display control module: After decoded by 
video decoding chip, the image signal can go through this channel and be output to 
the VGA. Each frame of NTSC video data is storage in the SDRAM dynamically, and 
then converted into RGB signals by VGA display control module, finally, displayed 
in the VGA. 

(5). Custom peripheral module: as the clock of Nios II is 100MHz, the real-time 
requirement of the system can not be met by using software to run the image data 
processing. After analyzing time-consuming of each step of the image processing 
algorithm, high time-consuming algorithms are hardware accelerated to reduce their 
run-time. So the real-time requirement of system is achieved. 
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Fig. 1. Finger vein authentication system structure 
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3 Image Data Processing Algorithm 
Image data processing is run on the Nios II, including three parts, which are prepro- 


cessing, venous contour extraction and authentication. The data processing flow is 
shown in Figure 2. 


Preprocessing 


procter code --- 2-2 ----------------- Success 


ee er as oe ee a Failure 


Fig. 2. Data processing flow 


(b) (c) 


Fig. 3. The result of image preprocessing: (a) Original image. (b) Image after copping. (c) Image 
after histogram transforms. 


An Authentication System Using Finger Vein Features 365 


3.1 Image Preprocessing 


Due to the limited of the image quality, the image preprocessing is required, including 
image cropping, and histogram transform. Image cropping is used for deleting unne- 
cessary background information, which is realized by hardware. Original picture size 
is 480*330*8bits, and changed to 210*330 after cropping. Histogram transform can 
enhance the gray-scale image to strengthen its finger vein information. After crop- 
ping, the image size is 210*330. So, it can be estimated that the histogram transform 
algorithm cycles about 140 thousand times. Figure 3 shows the results of image 
preprocessing. 


3.2 Venous Contour Extraction 


Venous contour extraction includes edge detection, feature extraction, and dilation. 
The application of edge detection algorithm is Canny operator [6]. The area of finger 
vein can be defined by the closed region of finger contour. Reference [6] shows that 
advanced functions, such as trigonometric functions, power function and exponential 
function, domain the edge detection algorithm. After the area of finger vein is de- 
fined, vein topological features can be extracted by the feature extraction algorithm. 
The steps of the feature extraction algorithm are shown as follows: 


1) Set outside of the image to zero to eliminate the feature points generated from 
the image border. 

2) According to the region of fingers vein create a same size memory matrix Z, and 
initialize each value to zero. 

3) The general finger vein width in image is 26 pixels, so set the vein width 
w= 26. 

4) Set the count value c = 0. Successively select points within the given region. 
Compare the selected points to their four sides, of which the size is 26*26. If the val- 
ue of both the top and bottom sides or both left and right sides is less than the centre, 
then c =c + 1. That means each point should be compared to it’s around 104 times 

5) After comparison of each point, if c>8 and its value (gray scale) is less than 170, 
then set the corresponding value in matrix Z to 255. 

6) Skeletonize the extraction features. 


To reduce the storage space, the binarization of vein features is used, i.e. 1 and 0 
stand for whether or not this position is the region of finger vein. The statistics shows 
that the average closed region of the finger contour is 120*330, so there are about 2 
million cycles. 

Dilation is used for expanding the finger vein to eliminate the impact made by 
small deformation of finger. The main part of the algorithm is cycle, about 40 thou- 
sand times. Figure 4 shows the results of extraction algorithm in different fingers. 
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Fig. 4. Original, vein extraction and dilation images of two different fingers 


3.3 Authentication 


To further reduce vein information storage space, the vein feature is 3*3 compressed 
after binarization. I.e. vein feature matrix is divided into several 3*3 sub-matrixes, if 
the number of | in each sub-matrix is greater than 4, then the sub-matrix is replaced 
by 1, otherwise use 0 instead of. The new matrix storage space can be shortened to 1 / 
9, and, as a result, reduce the running time of authentication algorithm. Number of 
cycles of the algorithm is about 4 million times. 

There are three methods of comparison to achieve authentication. The one is frequen- 
cy comparison. Via DCT, the vein image is transformed into the frequency domain. An 
advantage of this approach is high recognition success rate, but the drawback is the high 
time-consuming as complexity of the algorithm. The second is recognition algorithm 
using phase only correlation [5]. When the vein information is rich, the algorithm accura- 
cy is high, but in the case of the information-poor veins, becomes very low, with strong 
instability. The third comparison method is point by point comparison algorithm, i.e. 
compare the vein features point by point in a specific region. Advantage of the algorithm 
is simple and easy to implement, the disadvantage is low recognition success rate due to 
its high constraints. In this system, as the use of Canny operator and dilation algorithm, 
the region of the finger vein can be defined automatically, and the small deformation 
would be offset, which will greatly reduce the comparison constraints. So the third me- 
thod is used in the system, and the recognition success rate can reach a high level. After 
the compression algorithm the vein information matrix size is 40*110, so the number of 
algorithm cycles is about 5 thousand times. 


4 Algorithm Timing Analysis and Hardware Acceleration 


Figure 3.1 shows that the image processing algorithm includes image cropping, histo- 
gram transform, edge detection using Canny operator, vein feature extraction, dilation, 
compression, and point by point comparison. Since the system is user-oriented opera- 
tion, the system running time is set to no more than 2 seconds to meet the individual's 
tolerance. 

Because the image cropping done by the hardware, under the 10O0MHz of operating 
frequency, the time-consuming is about 2ms, and can be negligible. The high time- 
consuming algorithms are: 1) Edge detection algorithm (Canny operator), 6.8s. 2) 
Feature extraction algorithm, 4.1s. Running time of each part of the algorithm detailed 
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in table 1 below. Thus, the total running-time in software can not meet the system's 
real-time requirements. The major time-consuming algorithms, edge detection algo- 
rithm and feature extraction algorithm, need to be hardware accelerated. 


Table 1. Software running time of each part of the algorithm 


Software running} Ajgorithm 


time:(CPU 100MHZ)\-haracteristics: Renae 
Histogram Transform 0.3s Cycle About 140 thousand 
Trigonometric function, 
Edge Detection 6.8s Advanced 
computing —|power function 
Feature Extraction 4.1s Cycle About 2 million 
Dilation 79ms Cycle About 40 thousand 
Compression 81ms Cycle About 40 thousand 
Comparison 1lms Cycle About 5 thousand 
Total Time: 11.4s 


Feature 
Operator Extraction 


SSRAM (shared) 


Fig. 5. Custom peripheral system structure 


The hardware structure of edge detection algorithm (Canny operator) is shown in 
reference [7]. The image date stored in SDRAM is transferred through Avalon and 
processed by hardware. The result of edge detection is stored in SSRAM. Finally, a 
“done” signal will be sent to feature extraction module when the processing is over. 
Hardware vein feature extraction module is running in following four steps: 


(1). Define the region of finger vein according to the edge data stored in the SSRAM. 

(2). Process the defined region of image stored in SDRAM1, i.e. compare the selected 
point to four sides of rectangular around, and store the comparison result to SSRAM. 

(3). If all points in the selected region are processed, read the finger vein result in 
SSRAM and store it to SDRAM] through Avalon bus. 

(4). Send the conversion completion signal to the CPU and stop operation. 
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The structure of custom peripheral system is shown in fig.5. After hardware- 
accelerated, the total running time is reduced to 0.6s, to meet the system real-time 
requirement. The details are shown in table 2. 


Table 2. Final running time after hardware-accelerated 


Running time : Realization : 

Image cropping 2ms Hardware 

Histogram Transform 300ms Software 
Edge Detection 1.8ms Hardware 
Feature Extraction 20ms Hardware 
Dilation 79ms Software 
Compression 81ms Software 
Comparison 11ms Software 
Total Time: <0.6s 


5 Conclusion 


The system uses FPGA-based IP design and Nios II soft core processor. Nios II is 
used for software scheduling and controlling. While, some high time-consuming func- 
tions, such as some transform algorithms and video data stream processing, are 
implemented in hardware via HDL. Though the analysis of various parts of image 
processing software algorithms, high time-consuming ones are hardware accelerated 
and called by Nios II to meet the time requirements for data processing. 
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Abstract. Particle Swarm Optimization of excitation system parameters identi- 
fication in the study, A new adaptive inertia weight particle swarm optimiza- 
tion of excitation parameters used in the identification. Test proved that the new 
algorithm more powerful search capabilities to overcome a swarm of elementa- 
ry particles easily fall into the shortcomings of local optimum. 


Keywords: Parameter Identification, PSO, Adaptive, Inertia weight, 
Excitation. 


1 Introduction 


System identification is through the observation of a system or a process the relation- 
ship between input and output, describe the system or process to determine the input 
and output relations describing the system or process to determine the dynamic cha- 
racteristics of the mathematical model. Measurement system in accordance with the 
awareness level of treatment, the system under test can be divided into the black box 
system, gray box systems, white-box system. Excitation system is a gray box system, 
in accordance with the physical mechanism of a mathematical model, and then find 
the parameters of system identification. In this paper, time domain identification me- 
thod, which first identified the measurement system to treat non-parametric characte- 
ristics, time domain response obtained, and then the dynamic fitting technique, to 
strike from the dynamic characteristic curve model parameters. 

Normal operation of power system operation or accident, synchronous generator 
excitation system plays an important role. It has a control voltage to control the distri- 
bution of reactive power to improve the stability of synchronous generators operating 
in parallel, improving the ability of power system stability[1,2]. Most power system 
simulation software comes with some of the standard model of excitation system for 
the use of simulation analysis, however, the original model of the actual system is 
often not the software in the standard model. Therefore, the parameter identification 
method must be the actual model of the original model simulation software into the 
standard model of the library. In other words, the simulation software to determine the 
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model parameters, so that the actual excitation system model has the same or similar 
characteristics, for calculations. 

The particle swarm optimization (PSO) algorithm was first introduced by Kennedy 
and Eberhart in 1995[3]. As simple and easy to implement and robust features, has 
now been applied to many fields. However, the basic PSO algorithm to balance the 
global search and local search on the deficiencies. PSO updates the particle degree of 
inertia weight in the formula of the system global and local search plays an important 
role. In this paper, an adaptive inertia weight strategy, according to the objective func- 
tion value of each particle, adaptive selection of inertia weight. The objective function 
value of current particle of all particles smaller than the average target value, to re- 
duce the flight speed of particles; the current objective function value is greater than 
the particles the average of all the particles, the particles to accelerate the speed, set it 
as the maximum. Adaptive inertia weight particle swarm algorithm can meet their 
needs for each particle, which greatly improved the diversity of the population of 
particles to improve the PSO's search capabilities. The basic PSO algorithm with the 
known excitation system parameter identification comparative study to prove the 
validity of the algorithm and advanced. 


2 Mechanism of Excitation System Parameters Identification 


Excitation system model and simulation software in the original model structure and 
parameters are known, the parameter identification task is to determine the parameters 
of the standard model, so that the original excitation system model and the standard 
model of the input and output to maintain a certain error range. Identification prin- 
ciple is shown in Figure | 


X Physical | y E_, Identification A 
system algorithm 


Simulation 


Fig. 1. Schematic diagram of parameter identification 


Parameter identification process is, in the same excitation signal r under the influ- 
ence of the physical model of excitation system produces output )(f), The standard 


model of excitation system produces an output signal y(t), both error is €, after the 
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standard identification algorithm continuously adjust and optimize the model parame- 
ters until the error is € until the minimum[5]. 


3 Standard PSO Algorithm[3] 


The basic idea of PSO: random in the solution space to initialize a random population, 
which contains a number of particles, these particles in the solution space seeking the 
position of representative of solution to the problem, particles in the solution space to 
obtain the best according to their current location and the entire population in the best 
position to determine the current flight path, step by step approach the optimal region. 


Let Z, = (Zs Zjo9°''» Zigo’' > Zip) as the first i particles (i = 1,2, ..., m) of the D- 


dimensional position vector, Let V; = AV; Vig gt 3 Vig 8 5) for the particle i of 
the flight speed, that is, particles moving distance; Let 
P; =(Pa> Pine?» Piao» Pip) to search for the particles so far the best location; 


p, = Pas os Pes Se Da) for the whole particle swarm's best position so 


far to search. In each iteration, according to the following formula particle velocity 
and position updates: 


k k k 
=Wig FCN (Dia — Zia V+ 62% (Pea — Zia ) (1) 


k+l 


Via 


ae as a +Vig (2) 
k is the number of iterations, rl and r2 [0,1] random number between. rl and r2 is 
called learning factors or accelerating factor, which makes particle self-summary and 
excellent individuals to groups the ability to learn to the best advantage to their own 
history and the history within the group close to the most advantages. The first is the 
inertia weight, ability to play an optimal balance of local and global optimal capacity; 
the second is the "cognitive" part, on behalf of the particles on their own learning; 
second is "social" part, on behalf of collaboration among the particles. 


4 Adaptive Inertia Weight PSO 


When a particle in a local optimum, it may always be the best time evaluation, due to 
the interference of local optimum, resulting in a premature local optimum particles. 
According to the objective function value of each particle, adaptive selection of iner- 
tia weight. The current objective function value is less than the particle, the average 
target value of all the particles, to reduce the flight speed of particles; the current 
objective function value is greater than the particle, the average target 
value of all the particles, the particles to increase the maximum flight speed. Equation 
(1) the selection of inertia weight: 
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(Wrsie 7 Wrnin ME; ~~ Bins ) E < E 
_ Wain E E 2 i —~ ““avg 3 
sd a avg *~“min (3) 
Wrnax > E; 2 Es 
One, W,, and W,,,, represent the W of the minimum and maximum values, 7. for 


the particle current objective function value, F,,;,, and a are the minimum and 


average of all target particles. Objective evaluation function is the mean square error: 
T 2 
€=20,-5) (4) 
i=l 
One, T as the number of sampling points, y; as the best target. 


5 Example 


The following is based on adaptive inertia weight particle swarm optimization of 
excitation system parameters identification of specific steps: 


Step1: In the Matlab software, excitation system were established in the original 
model and the standard model, the equation (4) as the parameter identification of the 
objective function; 

Step2: the standard model to determine the parameters to be identified; 

Step3: Termination conditions and given the maximum number of iterations; 

Step4: For each particle is initialized; 

Step5: Enter the excitation signal, the simulation calculation, based on the original 
model of excitation system, the standard model of particle output error cal- 
culated fitness. 

Step6:Particle swarm optimization using parameter identification 

Step7:When the objective function value less than or equal eps, the end 


loop(eps = 2.2204107'°) 


Model (Figure 2), the parameter value: kz = 5.0513,k,, = 100.718,k; = 2.8965 


a) a a 
k, 1+0.02s 1+0.6s 1+5s 


1 2 3 4 


Fig. 2. Excitation System Model 
1:PID;2: Regulator;3: Exciter;4: Generator 
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Figures 3 and 4 are SPSO trend curve fitting with WSPSO 


Amplitude 


0 
(7 


Fig. 4. Adaptive inertia weight PSO algorithm output curve fitting trend 


Red response curve for the actual model output; Blue Wave as the standard system 
output. Figure 3 and Figure 4, we can draw contrast, the basic optimization ability of 
PSO algorithm is significantly higher than adaptive inertia weight PSO algorithm 
optimization poor. Identification results shown in Table 1: 


err Algebra time kp kp ky 
200 


5.0513 100.718 


Real value 5.0513 100.718 
Limit 6 200 
Lower limit 1 1 1 


6 Conclusion 


According to the objective function, the dynamic adjustment of flight speed particle 
inertia weight PSO can be well balanced regulation of local search and global search 
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capability. Avoid premature local optimum particle swarm. Basic PSO with the exci- 
tation system parameters identification comparison proved adaptive inertia weight 
particle swarm algorithm, accuracy and advanced. 
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Abstract. The Temporal locality and spatial locality have been repeatedly 
studied and widely exploited in the computer storage hierarchy. They are 
crucial principles of cache and buffer. But current spatial locality researches 
have been confined to a small time scale such as several instruction cycles. The 
periodic page accesses pattern can be discovered by expanding the time scale to 
seconds. To reduce the rate of cache and buffer miss is the primary goal of most 
cache and buffer management algorithms. When the miss is inevitable, the 
practical solution is to reduce the miss cost. A buffer cache management 
algorithm (Periodicity and Miss Latency Algorithm, PLC) has been proposed 
based on the periodic page accesses pattern and page miss cost evaluation. By 
keeping the pages that cost a lot of instruction cycles to swap in or out in 
cache/buffer as long as possible, the PLC algorithm has been proved practical 
and efficient. 


Keywords: page scheduling, management algorithm, periodicity, latency cost. 


1 Introduction 


To balance the conflict of performance and cost, the hierarchical architecture has been 
introduced into the storage subsystem. This hierarchical architecture consists of three 
levels of cache in CPU, RAM, and External Storages. The caches and buffers between 
two levels make it possible to get the approximate performance of higher level at the 
approximate cost of lower level. 

Since the caches are built by the CPU-similar techniques, their prices are high and 
capacities are restricted. A lot of algorithms have been proposed to maximize the effi- 
ciencies of the cache and buffer. The most common strategies are as followings: 

Optimal (OPT): This is the theoretic best strategy to decide which exist data or in- 
struction in the cache or buffer will be replaced by the pre-fetched one. This strategy 
will swap out the data or instructions whose next use will occur in the farthest future. 
Since it need to know the information about future, it can only been deployed for 
algorithms comparisons. 
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Least Frequently Used (LFU): This algorithm swaps out the page accessed least 
frequently. The access times are recorded in an access counter. 

Random (RDM): It just randomly selects out the page to be replaced by new one. 

Least Recently Used (LRU): This algorithm takes the least recently used content as 
the farthest future used. It is an acceptable simulation of optimal algorithm. 

Fist In First Out (FIFO): It just simply moves out the head content of the cache or 
buffer queue. 

Clock (CLK): The CLK is a combination of LRU and FIFO in practice. Because the 
LRU needs several bits to record its time stamp, it is a great consumption in the capaci- 
ty restricted cache or buffer. The CLK algorithm uses a ring to organize space of the 
cache or buffer. A pointer is designed to clockwise check pages of the ring. The first 
page encountered with used-bit of value 0 will be replaced. The other pages’ used-bits 
will be switched from | to 0.There are a lot of CLK algorithm variations have been 
adopted by Unix/Linux operating systems to manage their caches and buffers. 

The temporal and spatial localities [1] are the fundamental principles of these 
cache and buffer management algorithms. This locality theory tells us that most of 
instruction requests and data requests occur very closely in time scale or/and spatial 
scale. If we pre-fetch the instructions and data that have been stored near to the in- 
struction and data being processed, the seek time and reading latency will be reduced 
while the real fetch is required. 


2 The Page Access Behaivior Analysis 


2.1 The Latency Cost of Page Miss 


To reduce the rate of cache and buffer miss is the primary goal of most cache and 
buffer management algorithms. When the miss is inevitable, the practical solution is 
to reduce the miss cost. 

A specific page that be requested by an active process exists in cache of CPU, 
working set in RAM, cache in Hard Disk’s controller circuit or 8 sectors of a disk’s 
plate. Hence the request will be satisfied by cache hit or swap-in with latency cost. 
Table | shows how the storage system works when a page access is requested. 


Table 1. Cache hit and page swap-in 


es Access Access : yp an 

CPU’s RAM HD’s Cache Cache Hit To Higher 

Cache Hierarchy 
Page in CPU’s Cache Hit - - 1 time 0 time 
Page in Working Set Miss Hit - 1 time 1 time 
Page in HD’s Cache Miss Miss Hit 1 time 2 times 
Page in Sectors Miss Miss Miss 0 3 times 


The access latency of the top 3 hierarchies such as CPU’s cache, working set in 
RAM and HD’s cache is determined by electronic pulses of semi-conductor. It takes 
several nanoseconds or microseconds. While the access latency of the sector access 
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on a disk plate depends on mechanical movement. The average track seek time laten- 
cy and rotary latency of a Seagate’s ST3750528AS are 8.5 milliseconds and 4.17 
milliseconds [2]. The latency cost takes 12.67ms which is thousands of times higher 
than the former. 


2.2 The Periodic Page Access Pattern 


Current spatial locality researches have been confined to a small time scale such as 
several instruction cycles. The periodic page accesses pattern can be discovered by 
expanding the time scale to seconds. 

DiskMon [3] is a system event tracing tool provided by Microsoft to monitor disk 
access activities in Windows environment. It runned in an active web page server to 
record any disk access event in our laboratory. A department-level business 
assessment system has been deployed in this server. The hardware and software 
inventory of this server are shown in Table 2: 


Table 2. Inventory of Server 


Components Specifications 
CPU Intel Core E6300 
RAM DD2-667 4GB 
Hard Disk Seagate ST80P15K scis 300GB 
Operating System Windows Server 2003 SP1 
Internet Information Services MS IIS 6.0 
Active Page Services ASP.net 2.0 
Database SQL2005 
Development language C#2005 


The DiskMon’s monitoring lasted 900 seconds. Table 3 illustrates the distribution 
of disk sector accesses. The percentage of single access to a specific sector is less than 
28%. 


Table 3. Distribution of Disk Accesses 


Range of Quantity of Quantity of CDF of 
Sector Accesses Sectors in the Range Accesses in the Range | Access Times 
time 1335 1335 27.3% 
2~5times 310 913 46.0% 
6~10times 88 683 60.0% 
11~15times 33 423 68.7% 
16~20times 15 265 74.1% 
21~30times 4 100 76.2% 
31~100times 6 305 82.4% 
More than 100 2 858 100% 


The layout of disk accesses is shown in Figure 1. The horizon axis means time, 
while the vertical axis stands for the sequence number of disk sector. 
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Fig. 1. Sector Accesses Layout 


The DiskMon monitors disk access activeties by the unit of sector. Universally, a 
sector covers 512 Bytes, and a pge covers 4 kilobytes both in Windows and Linux 
environment. Each page consists of 8 sectores. Consequently, 2 disk accesses to the 
same sector means repetitive visit to a same page. Even 2 disk accesses to adjacent 
sectors most probably means visit to a same page. It makes acceptable to assess the 
periodical page accesses by monitoring sector accesses. 

The periodic page access pattern can be defined from the figure and table above: 


1) Page accesses concentrate on in a small scope of the sectors. 
2) These partial sectors are accessed at intervals. 


3 The New Page Scheduling Schema 


As we have discussed in 2.1, the performance of storage subsystem can be improved 
by reducing the miss rate. Many cache management algorithms try to keep the pages 
that most probably be accessed in near future. The methodology of these algorithms is 
to maximize the possibility of cache hit [4][5]. 

On the other hand, it is obviously that the cost of cache miss is expensive, especial- 
ly when the miss happens in the HD’s cache. Accordingly, we propose a new cache 
management schema with the methodology of minimizing the cost of cache miss 
based on the periodic page access pattern. We name it a PLC (Periodicity and Latency 
Cost) scheme. 

Traditionally, the physical memory of a computer can be shared by several running 
applications. Each applications occupies an active working set consist of an accessed 
area (AA) and a prefetching area (PA). The PLC takes an additional area for each 
cache working set. This area is reserved (RA) for the pages which have a high latency 
cost of being swapped into cache. The structure of PLC is shown in figure 2. 

If there is a page with a swap-in latency cost higher than average cost (12.67ms in 
our case), and the following page request was satisfied after an above-average latency 
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again. This means the latter page request is an isolated and random access. This page 
will be moved into the reserved cache area where is not checked by LRU or CLK 
algorithm. Figure 2 shows the structure of PLC shema. 


AAI | PAL | RAL | AAZ | PAZ | RAZ] ow. | AAn | PAn | RAn 
% ) X ) X ; 
eve Ye —~——* 
Working Set of Working Set of Working, Set of 
Application 1 Application 2 Application n 


Fig. 2. Structure of PLC 


Suppose there are two applications running in ram. Application | is with a refer- 
ence string of A, B, C, D, and 1,2,3,4. For example, the A, B, C and D are neighbor- 
ing pages in an executable file located in a user directory, while the 1, 2, 3 and 4 are 
pages of dynamic library files in a system file fold. Application 2 is with a similar 
page request combination of a, B, y , 6 and 1, 2, 3, 4. Hence A, B, C, D anda, B, y , 6 
are sequential page accesses separately, and 1, 2, 3, 4 are random page accesses. 

The system sends out a reference string of A, B, C, D, 1, 2, 3, 4, A, B, C, D, a, B, 
y,6, 1, 2, 3, 4. Figure 3 shows how PLC algorithm works. 


Sep 1 2 3 4 5 6 7 8 9 Db H RP B 4 HK 6 17 B 9 2 


Reference A B C D 1 2 3 4 A B C Dap y 6 1 2 3 4 


String 

A||) A} | A} ] AT} AJ] AP] AP] A} | AP} AP] A} P ATP a})affayjal}asjalja 
B|) B| | B} | B/| Bj| BJ} BJ] B} | B} | B) | BJ | BJ} Bl) BI} BI) By) | BI] BI] B 
Cy) CL} C} | Cr} Cyy Ce} Cy] Cy} Cry Cyl Cy) Cllyiiyilydiyvilyilyity 
D}|; DJ} | D} | D}| Dj] DJ} DJ | D} | D} | DJ | D) | DJJ S}) off 6] ) 6] } slj sl ]s6 
i ll al 1 1 1 i ee ee le ee ei i le 
Aes a aaa eaeaea eee 
SHR RE BoBC PT PCB reg 
4) 1/4) )4/|4) |4/|4/]/}4)/41/ 4] |4/) 41] 14 

(Prefetch ) (Prefetch) 


Mey 1267 0 0 0 1267126712671267 0 0 0 0 1670 0 0 0 0 0 0 


ms 


Fig. 3. Example of PLC 


Step1~4: Because A, B, C and D are sequential pages, only the access of page A 
leads a cache miss. Then, the prefetch mechanism works. All four pages will be read 
into cache at one time. The time consumption is 12.67ms. 

Step 5~8: Each access for page 1, 2, 3 and 4 will result a cache miss. The cost of 
time is 50.68ms. All these four pages are to be reserved because of their high cost. 
The shadowed grids mean to be reserved. 

Step 9~12: Another access for page A, B, C and D. Because of cache hit, the cost 
of time is approximate 0. 
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Step 13~16: Visiting pagea, B, y and 6 will result cache swapped out some old pag- 
es to get space. The LRU/CLK algorithm will swap out the page 1,2,3 and 4,because 
these pages are less recently used than page A,B,C and D. Instead, the PLC will re- 
serve page 1,2,3 and 4 for their high latency cost. 

Step 17~20: Another access for page 1, 2, 3 and 4. Due to PLC algorithm, they can 
be visit in cache. The time consumption is about 0. On the contrary, the LRU/CLK 
has already swapped them out. Each access will result in a cache miss. These 4 misses 
will cost 50.68ms totally. 

Figure 2 shows PLC encounters 6 misses that cost 76.02, while LRU/CLK will 
takes 126.7ms. 


4 The Implementation of PLC Schema 


The PLC has been deployed in a Linux2.6.32 system. The cache and buffer page 
frame is managed by the function of refill_inactive_zone() in the source file of 
mm/vmscan.c. The function was modified by the following algorithm of this schema: 


/*Function will be invoked when a page frame is 


requested* / 


void* PLC(page_frame_request p){ 


cost=0; 
while(p.next!=null1) { 
if p is in PLC_stack{ 


p.use_bit = 1; 


return p.pBPF;}/*return the page in cache*/ 
else if (p.use_bit==0&&p.res_bit==0) { 


cost = Jiffiles; /*Global variable of Linux, 


records the clock tick of system*/ 
replacestack(p);/*read page from disk*/ 
cost = Jiffiles - cost;/*the Latency Cost*/ 
p.usebit = 1; 
if (cost>threshold) /*threshold equals average disk 
latency */ 
p.res_bit = 1;/*reserved page*/ 
else 


p.res_bit = 0; 


return p.pBPF; } 
p=p.next; } 
} 
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Primarily, the PLC schema was tested in desktop applications. The hardware of 
the test machine is with a CPU of Intel P4 3.2G, 2GB DDR2-667 RAM and a 320GB 
HDD of ST3320620AS. The diff is a Linux tool to compare text files, and iostat is a 
tool to monitoring the I/O stat of the Linux system. We use the iostat to count activi- 
ties and throughput of the system while running the command of diff. Table 4 shows 
the differences between original LRU/CLK and PLC schema. The actual quantity of 
I/O activities has been reduced because of higher latency cost operations has been 
avoided. 


Table 4. Test Results From iostat 


rrqm/s wrqm/s rsec/s avgqu-sz await 
PLC 96 23 95123 4.35 12.5ms 
LRU/CLK 102 25 96321 4.94 13.78ms 
Improvement 5.88% 8.00% 1.24% 11.94% 9.29% 


Then, a Linux file system benchmark tool-PostMark [6] developed by NetApp 
was adopted to test the random access performance. The PostMark initially set up a 
file pool, then it create, delete, open, close, open, read and write file(s) randomly. 

The configurations of the test are as followings: 


set size 1000 50000 #amount of files varies from 1000 to 5000 
set location /usr/cfs  #directory of file pool 

set transactions 5000 #file transaction amount 

set number 5 #max concurrent transactions 

run result.txt #file for test results 


The results of the test illustrated that the whole test lasted 9 seconds, 7455 files were 
created successively, 2553 read and 2447 append(write) transactions occurred. Figure 4 
shows PLC improves the read speed and write speed by 4.11% and 4.91% respectively. 


10 ~ReadSpeed (MB/s) Write Speed (MB/s) 


Fig. 4. Test Results from PostMark 


5 Conclusions 


Spatial locality manifests periodicity if we expand the time scale from several instruc- 
tion cycles to several seconds. Since pages will be accessed repeatedly, reproducibility 
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of latency cost need to be considered carefully in cache and buffer management algo- 
rithms design and implementation. The PLC schema intends to minimize the high 
latency cost operations of the storage systems. Our experiments illustrates that under 
the given circumstance, this schema is practical and effective. 
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Microcontroller 
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Abstract. Aiming at the status that commercial robot structure is complex, 
expensive and unable to popularize, this paper designes a simple home service 
robot system based on Infineon 16-bit single chip microcontroller. It introducs 
the mechanical structure, and mainly recommends hardware and software to 
elaborate the design and realization of the control system. The robot adopts 
double controllers, with two model cars as its feet, each of which communicates 
with each other through wireless communication module to walk simultaneously. 
It uses Camera to automatically identify target objects, and emploies ultrasonic 
sensor to measure the distance between robot and target objects. It adopts 
E-compass to fix position, uses phonetic module to realize human-computer 
interaction and grabs target objects with arms. Experimental results show that the 
robot realizes the expected functions, with its features of simple structure, low 
cost, flexibility of control and certain application value. 


Keywords: Infineon; Microcontroller; Home service robot; Electronic compass; 
Wireless communication. 


1 Introduction 


Along with the rapid development of intelligent robot technology, the application of 
intelligent robot is expanding constantly. It is not only applied in industrial and 
agricultural production, but also even used in home service industry. Home service 
robot, which merges the core technologies of robot vision, phonetics, intelligent 
human-machine interaction, network technology, sensor detection, control technology 
and system integration together, will replace human to finish all kinds of housework, 
including clean sanitation, items, handling, family entertainment, patient monitors, 
could greatly improve the quality of people's lives. Meanwhile, China is facing more 
and more serious problem of aging population, so robot will have great application 
prospects in many families with older or disable people. However, in China, the 
research of humanoid robot starts later, there is still a big disparity between our country 
and the robot powers such as Japan and Europe countries. Although current high-end 
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home service robots abroad have a great number of functions, their structures are most 
complex, and the price is high and are unable to popularize temporarily[1]. 

Aiming at the characteristics that home service robot should have, this paper 
designs a home service robot system with the advantages of simple and practical 
mechanical structure, low cost and flexibility of control. It uses two cost-effective 
Infineon site microcontroller as the core controllers and aims at system reliability and 
stability, real-time ability and optimization of overall performance. 


2 Summary of the Mechanical Structure and Function of Robot 


2.1 The Mechanical Structure of Robot 


The robot is consisted of five parts of head, arms, waist, legs and feet. It has 17 degrees of 
freedom. Head has two degrees of freedom, which is used to realize pitching and 
horizontal rotation. It is equipped with a camera, which is used to realize target 
recognition and allocation and to provide target information to the master 
controller;Ultrasonic sensorh is used to detect the distance between robot and target, and 
then report it back to the master controller to control the speed. Each arm has five degrees 
of freedom and can extend and bend to grab the target accurately. Waist, which is 
equipped with a E-compass to detect the pitch and azimuth of robot, has one degree of 
freedom to realize horizontal rotation of the upper body of robot. Knee has two degrees of 
freedom to realize crouch. Feet are two Freescale intelligent vehicles, each of which is 
consisted of a car body, a steering servo and a DC motor. Each of the feet is equipped 
with a microcontroller, which is used to realize the information collection and processing 
and control algorithm; Wireless communication module is used to exchange information 
between two controllers. The mechanical structure of robot is shown in Fig. 1. 
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Fig. 1. Robot mechanical structure 
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2.2 Summary of the Function of Robot 


The robot can realize human-computer interaction, target tracking, path recognition 
and object grabbing. The specific work process is that, turn on the power switch firstly, 
then robot will wait to receive a certain voice command. Once start command is 
received, the motor of waist will rotate and the camera of head will search the certain 
target. After got the certain target, the angle and location from E-compass will be 
obtained. Then the motor of Waist will rotate back according to the angle. After 
adjusting its body, the steering servo on the feet will act according to the angle from the 
camera and the E-compass and the DC motor will act to track the certain target. 
Meanwhile, the distance from ultrasonic sensor is got to control the speed of robot. 
When robot is close to the target, it will stop moving and the arm will act to grab the 
target. Then robot will return back to the starting point and put down the target. 


3 The Overall Design of Control System 


The control system is a hierarchy control system based on two controllers, one of which 
is master controller and the other is slave one. In this hierarchy control architecture, the 
controllers have a clear division of work and it exchanges information between the two 
controllers through wireless communication, which can relieve the master controller’s 
pressure generated by the frequent interruption of multi-interrupt sources and can 
enhance the robot’s response ability and stability of operation. 

The control system of home service robot adopts modular design. The master control 
system includes master-control module, perception module, display module, wireless 
communication module, power conversion module and driver module. The slave 
control system is consisted of distance measuring module, driver module, power 
module. The whole block diagram of the system is shown in Fig 2. 
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Fig. 2. Control system structure 
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4 The Hardware of Control System Design 


The hardware of control system also adopts modular design, and the detailed design 
modules are shown as follows: 


4.1 The CPU Control Module 


According to the fact that the control system should have good expansibility and rich 
resources, this system adopts the latest Infineon XE164FM - 72F80L 16-bit 
microcontroller which is produced by Germany Siemens Infineon Technologies. The 
microcontroller has advantages of small volume,high performance-price ratio, high 
integration, easy system extension and strong ability of data and can carry out digital 
signal processing. It contains 16 A/D converters with 10-bits precision, 16 channel 
universal capture/comparison, can produce 16 unit PWM waves, and possess strong 
ability to process the interrupts generated by 96 interrupt sources and 16 interrupt 
priorities[2]. 


4.2 Perception Module 


It mainly includes the visual system, the hardware circuit of ultrasonic sensor, phonetic 
recognition. The visual system uses CCD camera, whose pixel is 320 X 240. The CCD 
camera, whose output signal is PAL, outputs 50 fields of pictures per second(divided 
into strange and dipole field). In order to collect camera video signal effectively, it is 
needed to capture sync signal and vertical sync signal, otherwise microcontroller will 
not receive the video signal correctly. The video sync separator chip, LM1881,which is 
produced by America's national semiconductor company, is used to separate the video 
signal. Fig. 3 is the circuit principle diagram of signal separation of video. 


Fig. 3. LM1881 video signal circuit separation 


Phonetic recognition module mainly completes certain people’s voice recognition 
and speech synthesis output. The phonetic recognition chips, AP7003, is a new type, 
low cost, phonetic recognition, application-specific integrated circuit with microphone 
amplifier, an A/D converter, phonetic processor and the I/O controller. It can identify 
12 groups of words after pretreatment, each of which spends 1.5 seconds, and is highly 
programmable and easy to use. Voice signal is input to AP7003 by microphones and 
then to the robot. 
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4.3 Power Conversion Module 


The system uses four 7.2 v, 2000mAh nimh batteries. Because each part requires 
different voltage, different power switching circuit are designed to provide the voltage 
of +12v, +5v, 6v and +3.3v separately. 


4.4 Motor Driver Module 


It uses PWM to adjust the motor speed. The motor drive chip is MC33886. It is a 
special motor drive chip manufactured by Motorola company, and the maximum output 
current is 5A. Two parallel chips could improve the drive ability, and the biggest drive 
current can reach 10A. The circuit diagram is shown in Fig 4. 


Fig. 4. Motor driving circuit 


4.5 Wireless Communication Module 


NRF24L01 which is a single chip wireless transceiver integrated chip is adopted. It 
employs the GFSK modulation with strong anti-jamming capability, with the 
advantage of stable and reliable working frequency, and its peripheral devices are less, 
power consumption is low and it is suitable for handheld and portable product design. It 
is mainly used in robot communication between master and slave controller. In order to 
make the two feet coordinated and synchronized, the master controller uses wireless 
module to exchange information, which includes the visual information measured by 
camera and distance information measured by ultrasonic sensor, to control the direction 
and speed of the two feet. 


4.6 Electronic Compass 


It is put in the waist and mainly used for measuring the heading angle and pitch to 
navigate and keep balance. It is mainly consist of control unit (MSP430), three axis 
acceleration sensors (SCA3000), a temperature sensor (TMP100) and magnetic sensors 
(PNI11096)[3]. The temperature sensor is responsible to compensate for measurements 
of acceleration sensor and magnetic sensor. PNI is a drive chip, which converts the 
weak current measured by PNI11096 into the digital quantity that can be identified by 
MSP430. 
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5 The Software Design of Control System 


The whole structure of control system is shown in Fig. 5. The software of control 
system includes two main parts: software of master control system and slave control 
system. 
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Fig. 5. Overall structure of control system software 


The software of master control system is consist of main program, subprograms of 
camera information acquisition, ultrasonic distance measuring, wireless communication, 
grabbing object of arm, E-compass information acquisition and PWM generation. The 
main program includes system initialization, control flow and realization of typical 
control algorithm. The subprogram of camera information acquisition accomplishes the 
acquisition of camera information. It gets the gray level of every frame picture through 
capturing the link interruption and field interruption and then gets the object’s location in 
the camera’s world coordinate system. The software flow pattern is shown in Fig 6. 

The subprogram of ultrasonic distance measuring completes the distance measuring 
between robot and the target. The controller firstly generates a trigger pulse of 10us to 
trigger the ultrasonic module to transmit a series of ultrasonic. Meanwhile the timer and 
the capture interruption is opened. The ultrasonic will return back when it comes across 
the obstacle and will be received by the receiver of ultrasonic module[4][5]. Then the 
receiver will generate a falling edge pulse. After the controller captures this falling 
edge, it will close timer andfis got. The distance will be calculated according to the 
formula: 


S =340*1/2 (1) 
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The software flow pattern is shown in Fig 7. 
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Fig. 6. Camera acquisition flowchart Fig. 7. Ultrasonic ranging flowchart 


The subprogram of wireless communication completes the data exchange between 
master controller and slave controller. It includes the initialization of NRF24L01 chip, 
configuration of address and data length and transceiver sub-function. The subprogram 
of serial communication realizes the data delivering between the controller and 
E-compass. The subprogram of PWM generation generates PWM of different period 
by using timer to drive the servos of arm and motors of head and feet. 

The subprogram of camera information acquisition, subprogram of ultrasonic 
distance measuring and the feet of robot realize target tracking together. Target tracking 
means that robot moves forward or backward and turn left or right by following a 
certain object. In the view of camera, a plane coordinate system is built. In the 
coordinate system, the original point is on the left bottom and both X-axis and Y-axis 
are divide into 100. The target is black and the controller store the gray level of object 
into a two-dimensional array. The data in the two-dimensional array is detected to get 
the coordinate values of the edge of target. The coordinate values of upper and lower 
edge are added together and then is divided by 2. The coordinate values of left and right 
edge are also added together and then is divided by 2. And the coordinate values 
(P,P,) of the center of target in the camera world coordinate system are got. To 
compare (P,P,) with the center (50,50) of camera world coordinate system, if P, is 
bigger than 50, the camera will deflect downward; otherwise, the camera will deflect 
upward. This can ensure that the camera can always capture the target. If P is bigger 
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than 50, that the target is on the left of robot and robot will turn left; otherwise, it will 
turn right. This ensure that robot can always track the target[6]. 

The subprogram of grabbing object of arm controls the arm to grab the target object 
according to the position got by camera and ultrasonic module. 

The software of slave control system includes main program, wireless communication 
subprogram, voice recognition subprogram and PWM generation subprogram. The master 
control system and the slave control system coordinate together to realize the basic 
function of robot. 

The whole software flow pattern of master-slave control system is shown in Fig 8. 
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Fig. 8. Overall software flowchart 
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6 Conclusions 


By combining with the design function of robot, this paper establishes a hardware 
platform that uses two micro-controllers as the core. The basic function of the home 
service robot is realized by the cooperation of every module. This robot has the 
advantage of low cost, modularization, easy expansibility, easy portability and high 
reliability. More functions can be realized through external sensors and flexible 
software programming to really satisfy the design requirements of home service robot. 
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Abstract. ADA is a high-performance language for scientific and technical 
computing. Different program packages can be used in corresponding areas, 
such as math, control theory, economics. We describe the design of the Leontief 
input-output model toolbox which can simulate the trajectories of input and 
output of economic systems and automatically control the run of economic sys- 
tems. By means of mathematic methods, the Leontief input-output systems are 
directly treated without converted to general systems. A sufficient condition 
under which the Leontief input-output models are stable is investigated. Based 
on this, the control algorithm is derived. Finally, the code of the input-output 
control program is provided. 


Keywords: ADA, Design, Input-output Model. 


1 Introduction 


ADA is a high-performance language for scientific and technical computing. At first, 
the scalar noninteractive languages, such as C or Fortran, are used to compute the 
scientific and technical problems. But these languages can hardly deal with the matrix 
and vector equations. ADA’s basic data element is an array which does not require 
dimensioning. So ADA can easily solve many computing problems, especially those 
with matrix and vector formulations. Matrix is a mathematic tool which is utilized in 
many areas, such as economics [1]. 

In the past years much attention has been paid for the simulation and control of 
economics. Many economic models were founded to simulate the run of economic 
systems. Most of those models were matrix and vector equations. Then economists 
tried to solve these equations and hoped to control these models. At first, economists 
completed the study of the static economic systems’ matrix equations. However, most 
economic problems are dynamic. Then, the economists have to study the dynamic 
economic systems’ matrix equations. They often investigate the current state of an 
economic system and ask how various policies can be used to move the system from 
its present status to a future more desirable state when they deal with those dynamic 
systems [2]. A number of fundamental notions and methods based on the theory 
of economical cybernetics have been extended. When a more detailed description of 
the production side of an economic organization is desired with the development of 
macroeconomics, this leads to a so-called input-output analysis (i.e. input-output 
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economics). Harvard Professor Leontief, who opened the door to the input-output 
economics, put forward the Leontief input-output model in 1949 [3]. The classic 
Leontief input-output models are static systems with linear matrix formulations. Then 
in the region of input-output economics, many models were established to describe 
the real economics [4]. The general linear models of input-output economics were 
well investigated. However, many systems are singular linear models, which are 
much harder to be solved than general linear models. In the area of cybernetics there 
is a rapid progress in the control of singular systems because such systems can de- 
scribe many real systems such as economic systems [5]. Singular models in econom- 
ics are generally converted into general linear models by means of the selection of 
state vector, control vector and output vector. We hope to research the structure of the 
singular Leontief input-output model’s matrix formulation and find out a control algo- 
rithm of this kind of model. Based on this algorithm, the simulation and automatic 
control of the Leontief input-output model will be completed [6]. Furthermore, the 
classic Leontief Input-output Model does not consider the economic architecture’s 
change. The coefficient matrix of classic Leontief Input-output Model, which reflects 
the quantity of investment in each sector, is static and can not correctly describe the 
variety of economic system when the architecture of economy has changed. So, 
the form of coefficient matrices of Input-output Model can be improved to describe 
the variety of economic system’s architecture. 

Different program packages can be used in corresponding areas, such as math, con- 
trol theory, economics. A new input-output model toolbox need be designed for the 
research of the Leontief input-output model. In this paper, ADA is used to design 
such toolbox. 


2 Simulation of Leontief Model 


Prior to the design of the input-output model package, we must confirm the function 
of this toolbox. Firstly, this toolbox should be able to solve the equations of the Leon- 
tief input-output model and simulate the run of the Leontief input-output model. 
Secondly, the toolbox could automatically search a stable solution of the Leontief 
input-output model when we make an economic plan. 

Now we will firstly investigate the equations of the Leontief input-output model 
and simulate the trajectories of input and output. The classic economic dynamic input- 
output model is described by 


x(k) = Ax(k) + BLx(k +1) —x(k)]+ ¥(k). (1) 


The vector x(k) =[x,(k)...x, (k)]’ € R” is the total output vector and x,(k) is the 
total output from sector i. ¥(k) =[y,(k)... y,(k)]’ is the final net product vector and 
y,(k) denotes the final net product of sector i. The matrix A=[a,] is the direct 
consumption coefficient matrix, B=[b,] is the capital coefficient matrix. In fact, 


Matrix A and B do not maintain unchanged in the real economic life. Generally, 
the average profit margins vary from sector to sector: a higher one for the sector 
whose demand exceeds supply and a relatively lower one for the sector oversupplied. 


Computer Simulation of the Leontief Model in ADA 395 


Consequently, an enterprise with a low profit margin would invest capital in a high- 
profit sector and thus the economic structure is changed, which means the change of 
the coefficient of matrix A and B. Manifested in the input-output model, matrix A and 
B keep changing every year and the value of matrix A and B is determined by supply 
and demand relations. The allocation of investment is described by capital coefficient 
matrix B. If the investment in sector j is raised, the corresponding coefficient of the 
jth row of matrix B will be increased; if the investment in sector j is reduced, the 
corresponding column coefficient of matrix B will be decreased. Assuming that Bc(k) 
is an elementary matrix, when the investment in sector i is raised, element on Row i / 
Column i is constant Pi which is greater than 1, and when the investment in sector i is 
reduced, element on Row i / Column i is constant Pi which is less than 1. Then the 
change of capital coefficient matrix could be described via the equations: 


B(k) =B:- Bc(k) and Pi=1+(P(i)— Paverage) . (2) 


Matrix B is the initial capital coefficient matrix; Constant P(i) is the profit margin of 
sector i; Paverage denotes the average profit margin. Then the above dynamic input- 
output model can be rewritten as 


B(k)x(k +1) = (1 — A+ B(k)) x(k) —Y(k) (3) 
where rankB(k)=r<n. 


The equation (3) is a difference equation. Matrices A and B can be determined by 
economic policy makers. Then the next question to be solved is the value of Y(k). In 


fact, Y(k) can be considered as the control vector of economic discrete-time singular 
dynamic input-output model because we can affect the quantity of final net product by 
controlling the scale of investment. Then Y(k) can be treated as control vector which 
can be specified by governments or companies. Thus the equation (3) can be solved. 


Remark 1. In this paper, R” is the n-dimensional Euclidean space; R””” denotes the 


mxXn real matrices space; J is the nxn identity matrix; A’ denotes the matrix 
transposition; “*” is used as the term that is introduced by symmetry; when matrices 
X,Y are symmetric, X < Y means that matrix Y—X is positive-definite. 


Remark 2. By using of the function “dsolve(Q”’, we can get the solution of equation 
(3). Of course, we can also design the new function to solve the equation (3). The 
following code is a one of such a function. 


procedure read_abc is 
type big is new real; 
d,y11,y21:real; 
b11,b12,b21,b22,a11,a12,a21,a22:big; 
1: integer 
for i in 1..100 loop 
b11*dx+b12*dy=al1ll1l*x 
b21*dx+b22*dy=a21*x 
end loop; 
end 


— 


)+al2*x(2)-y11; 
)+a22*x(2)-y21; 


— 
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We can get the solution of the equation (3) by chase method and recursive invocation 
of the above procedure. 


In this section, we describe the design of simulation of the Leontief input-output 
model. When we can know the values of matrices A, B and Y(k), the solution of 


Leontief input-output model can be easily derived. Figure | shows one example of 
Leontief input-output model. 


3 Stable Solution of Model 


In real society, there are many problems concerning the control of the economic sys- 
tems’ scale. One important control problem is about the quantity of final net product. 
Generally, if the final net product in the first year is d,, it is hoped to increase to a 


higher level several years later. However, the quantity of the final net product could 
not increase forever because resources on the earth are scarce. Scarcity means that 
society has limited resources and can not allocate all the resources to one sector. So it 
is hoped that the quantity of the final net product remains at a quite stable level. So, 
we need to research the stability of the input-output models. In economics, capital 
coefficient matrix B is not always invertible, because the product of some sectors can 
not be treated as capital product and applied to invest. Then the system (3) is a singu- 
lar input-output model. The singular systems are hard to handle. Fortunately, the sta- 
bility of singular system (3) is equal to the stability of the following system: 


Bk) x(k +1) = Ud — At B(k)) x(k) (4) 


where B(k) = B- Bc(k). So the research of the stable solution of singular systems (3) 
can be converted into the research of system (4). To investigate the stability of system 
(4), the following definition of stability is needed: 
Definition 1: The discrete-time singular equation: 


B- Be(k)x(k +1) = (1 — A+ B- Be(k)) x(k) (5) 


(1) System (5) will be called regular if det(sB, —B,—I+ A) is not identically zero 
where B, = B- Bc(k). 

(2) System (5) is causal if deg(det(sB, — B, —I + A)) = rankB, where B, = B- Bc(k). 
(3) System (5) is asymptotically stable if any root of det(sB, — B, —I+ A) =0 lies in 
the interior of the unit disk with center at the origin where B, = B- Bc(k). 


(4) System (5) is called to be stable if it is regular, causal and asymptotically stable. 
In general, it can be assumed that the quantity of final net product rely on the quantity 
of total output product. In other words, we assume that the quantity of final net prod- 


uct can be described by Y(k)= gx(k) . Then the system (5) turns into the form of 
B- Bc(k) x(k +1) =(1 — A’ + B- Bc(k)) x(k) (6) 


where A’ = A+ g . Next, we need to research the stability of system (6) and the 
value of g. This problem can be solved with the following algorithm. 
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Theorem 1: The singular dynamic input-output system (6) is stable if there exist 
matrices P>0O, Q and S, with proper dimensions such that the relations 


r+ zr" <0 (7) 
and B/S, =0 (8) 
can establish for each k where 
B, = B-Bc(k), 


1 1 ’ , , , ~ 
n= P+ A"PA PA’ + PB, — A” PB, + QS," —QS,' A+ QS,'B,. (9) 


Proof of Theorem 1: For system (6), there exist two nonsingular matrices M, and 
N, such that 


B,=M 1 O)7, 10 
kek Q g k (10) 


where J, € R’“*«*""*% ig an identity matrix. 
Then, pre-multiplying (J—A’+B,) by M," and post-multiplying (J —A’+B,) 


by N,' one can get a new matrix: 


T, T. 
Mi"=8' BN =| 1k 1 
Ty, Ty, 4 ad 1) 
: , qT, -f, Ty, 
It can be written as: A=I-M, N, - 
Ty, Ty 
Obviously, S can be: 
ey i 0 
S,=M, : (12) 
I, 
Then we can write 
P, P 
p=m,"| : Ta o=m| Oe |. (13) 
Py Py 2, 
where the block is compatible with that of (10). 
Together with (9)-(13), (7) is equivalent to: 
X+2! = P—PA’— A” P+ A” PA’+ PB, — A” PB, +B,’ P 
—B," PA’ + QS" — QS" A’+ QS" B, + SQ’ —A™ SQ’ +B,’ SQ’ (14) 


where 
_ pT T T T T AT 
Loo _ Ty Py Dy +1, Py Thy +1, Py Ty +1, Pa Tay + QT +T,.Q, i 


From formula (14), we can get Z,, <0. Utilizing the reduction to absurdity, we can 
easily prove that the matrix 7, is nonsingular. Now, define: T, = 1 —A’+B,. 
Then, for system (6) it is easy to see: 


if = hy TT, 


s 
det(sB, —T,) = det(M, 


|wo = det(M,N, pail“ te te ) 


a Ty, "3k Ty, 


Obviously, there exist two nonsingular matrices L and R such that 
o58 Pe ]e-|" =i a 
Ty Ty -T,, -I, 


1,-T, —T; 
So, it can be easily seen that tes ec *} is not identically zero and the 
"3k an 
degree is rankB,. For M,N, L, and R are nonsingular, det(sB,—T,) is not 
identically zero and deg(det(sB, —T,)) = rankB, . So this discrete-time singular input- 
output system (6) is regular and causal. 
Next, we will prove that system (6) is asymptotically stable. Since system (6) is 
regular and causal as proved above, according to [8] can find two nonsingular matric- 


es M and WN such that 


Pua fb sre os : ete SOr|| 
B, =M, 0 Gir ere 0 1, N,. (15) 
Then, S can be chosen as 
~ | 0 
S=M, ‘ (16) 
I, 
Define 
gle! P| . 
p= u,"| eh 2 ie O= Ne Fa (17) 
Py, Py, 2, 
Substituting (15)-(17) into (7), we can acquire 
ce T,Py, + Ox, Js <0 
Keo eT yes Pe es es k ° 
ans +01 Py +O, +O; 


Using the Schur complement formula, we have TPT. -P, <0. From [9, 19], we 
know that every root of det(sB, —T,) =0 lies in the interior of the unit disk with 


center at the origin. So system (6) is asymptotically stable. Thus system (6) is regular, 
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causal and asymptotically stable which means system (6) is stable. This is the end of 
the proof. 


Remark 3. Theorem 1 offers a sufficient condition for the discrete-time singular dy- 
namic input-output system to be stable. Based on this, we can get the stable solution 
of the Leontief input-output model (3). The following algorithm is such a result. 


4 Conclusion 


The design of the Leontief input-output toolbox in ADA has been provided. This 
toolbox provides the simulation function of the Leontief input-output model and can 
find out the stable solution. One important aspect of our design is the extension of 
classic Leontief Input-output Model. The capital coefficient matrix is designed as 
function of time k. A new algorithm is designed to get the stable solution of singular 
input-output model. Finally the Leontief input-output model toolbox is completed. 
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Abstract. The bearingless synchronous reluctance motor (BSynRM) is a multi- 
variable, nonlinear and strong-coupled system. To solve the difficult problem of 
precise decoupling in electromagnet torque and radial suspension force control, 
the direct suspension force control (DSFC) method is presented with the refer- 
ence of direct torque control based on space vector pulse width modulation 
(SVM), and the direct suspension force control (DSFC) algorithm is deduced. 
The direct suspension force control (DSFC) system, which a double closed-loop 
control system of the rotor radial displacement and radial suspension force is 
constructed to control the radial suspension force directly, is designed and si- 
mulated. The simulation results show the stable suspension of the rotor and 
good performance of the proposed algorithm that realizes the decoupling con- 
trol of electromagnet torque and radial suspension force. 


Keywords: bearingless synchronous reluctance motor (BSynRM); direct sus- 
pension force control (DSFC); space vector pulse width modulation (SVM); 
decoupling control. 


1 Introduction 


A bearingless synchronous reluctance motor (BSynRM) has many special advantages 
and characteristics. Compared with the bearingless permanent magnet synchronous 
motor, the BSynRM without rotor windings and permanent magnetic material can be 
used for high-speed applications including operation at high temperature. The 
BSynRM in comparison with bearingless induction motor is low cost and control 
algorithm of the BSynRM can be simplified because it is not necessary to compute 
the slip. Therefore, the BSynRM has a wide range of applications, for instance high- 
speed machine tool, turbo molecular pump, turbo compressor, flywheel energy 
storage etc[1-5]. 

Radial suspension force of the BSynRM is generated by the magnetic field interac- 
tion between the torque windings and radial suspension force windings, which deter- 
mines between the torque and radial suspension force existing strong coupling[6-7]. 
To ensure the BSynRM stable operation, decoupling control of electromagnet torque 
and radial suspension force is necessary and realizes the dependent control of torque 
and radial suspension force. At present, radial suspension force control algorithm of 
the BSynRM mainly uses the rotor magnetic orientated method. However, this control 
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method exists some weaknesses as follow: (1) this algorithm is difficult to achieve the 
theoretical results due to the complexity of vector coordinates transformation; (2) an 
open-loop control of radial suspension force affects the accuracy and dynamic re- 
sponse performance of radial suspension force control; (3) this control method adopts 
current following inverter, which has high switch frequency and the utilization rate of 
inverter capacity is relatively low. 

With the reference of direct torque control based on space vector pulse width mod- 
ulation (SVM) [8-10], hence the direct suspension force control (DSFC) method is 
presented and the control algorithm is deduced. Moreover, the DSFC system for the 
BSynRM is designed and simulated using Matlab software. The simulation results 
have shown that the rotor can be suspended stability, torque and radial suspension 
forces can be controlled independently and control system has strong robustness. 


2 Mathematical Model of the BsynRM 


The rotor saliency effect results in the magneto-resistance torque. The torque of the 
bearingless synchronous reluctance motor can be written as follow. 


i= = Ps (Woredag aq Varsha) () 
where i,1,, isig are the stator O-axis and B-axis currents of the stator torque windings 
respectively; Yiu, Yip are the stator o-axis and B-axis air gap flux linkages of the 
stator torque windings respectively; py is pole pairs of stator torque windings. 

The mathematical model of rotor radial suspension force is built on the as assump- 
tion that: (1) Assuming the radial displacement of the rotor is small enough to air gap 
length 6,; (2) Assuming the initial value of the space position angle and the initial 
position angle of the rotor are zero; (3)Neglecting the leakage flux of suspension force 
windings. The radial suspension force acting on the rotor a-axis and B-axis can be 
written as follows. 


= 2 
F,=[, ora (—hoosplg 


nee (2) 


F,= > ab Psingdg 
24 


where / is the core stack length, r denotes the rotor radius of the salient poles, uo is 
permeability of vacuum, ¢ is the space position angle, B is motor air gas flux density 
including torque windings and suspension force windings. 

In order to simplify the radial suspension model, so the equation (2) integral divides 
into four integrals over the rotor salient poles[11]. The mathematic model of radial 
suspension force acting on the rotor can be written as follows. 


i = ky Vino cos(A — 1) + ky iniYoo cos(A+ {) (3) 


Fa= Ky WniYoo Si A = 1) + kyu Woo Si(A + 2) 
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nr, 
12N,N,iru, >” 8N,N,IrLy 
flux leakage, yw, is stator flux linkage amplitude of suspension force windings, N,, N> 
are the number of turns which are belong to the torque windings and suspension force 
windings respectively, 4, “ are the initial phase angle of air gas flux leakage. 


where k,,, = »Wmi is the amplitude of torque windings air-gap 


3 DSFC of the BSynRM 


The direct suspension force control (DSFC) method adopts double-loop control, in 
which the inner closed-loop is radial suspension force loop and the outer closed-loop is 
rotor radial displacement loop. According to the increment between given value of the 
radical suspension force and the feedback value of the radical suspension force, incre- 
mental flux leakage of suspension force windings is generated by the DSFC algorithm, 
and then uses SVM principle generates the corresponding space voltage vector to drive 
the voltage source inverter to achieve the suspension force windings and radial force 
flux leakage direct control. 


3.1 Basic Principle of DSFC 


According to the equation (3), the radical suspension force can be divided two parts. 
Fig. 1 shows the vector diagram for DSFC. /-y is the angle between radial suspension 
force vector F,and A-phase windings axis, A+ is the angle between radial suspension 
force vector F,and A-phase windings axis. So, the ky Wn Yoo, Kuo Wn Yoo are the am- 
plitude of radial suspension force vector F, and F respectively. The equation (3) can 
be transformed as follows. 


: = F,cos(A— 2) + F, cos(A + fl) mn 


F,=F, sin(A—“)+ F, sin(A+ 4) 


Radial suspension force depends on the amplitude and direction of suspension force 
windings flux leakage and torque windings air-gap flux leakage. A stable radial force 
of the rotor is required by the control of suspension force windings flux leakage and 
torque windings air-gap flux. The motor electrical time constant is much smaller than 
mechanical time constant, assuming the location of the rotor is stationary within a very 
little time period At, hence the small-signal model can be used to analyze the mathe- 
matic relationship about the radial suspension force, flux leakage of suspension force 
windings and the torque. 

(1) In steady state, the BSynRM operates at rated speed or rated torque. Within time 
At, the amplitude and phase of torque windings air-gap flux linkage are constant, and 
the torque angle dis constant too. So suspension force can be controlled by regulating 
the amplitude y%, and phase / of suspension force windings flux linkage derived from 
equation (3). 

(2) In transient state, torque of the BSynRM is variable or motor speed changes. 
Keeping amplitude y, constant, hence torque control is implemented by regulating 
the torque angled shown from Fig. 1. Assuming torque increases AT, which will result 
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in the decrease of amplitude y,,, and increase of angley, and the rotor radial suspen- 
sion force amplitude and phase angle will vary at the same time derived from 
equation (3). To compensate radial suspension force, it is necessary to increase the 
amplitude and phase of suspension force windings flux linkage. For reducing the 
electromagnetic torque, the situation is vice versa. 


Fig. 1. Vector diagram of direct suspension force control 


3.2. DSFC Algorithm of the BSynRM 


Constant torque operation of the BSynRM can be regarded as a special state of the 
variable torque operation, so control algorithm of DSFC in variable torque operation 
state only is deduced considering the universality of control algorithm. Fig. 2 shows 
vector diagram of rotor radial suspension force and suspension force windings flux 
linkage. 

From Fig. 2, it can be seen vector diagram of two variables during the period from 
t to t+1, which include rotor radial suspension force and suspension force windings 
flux linkage. At t moment, suspension force is F(t). Using small signal model, assum- 
ing torque increase AT, during the period from ¢ to t+1, which makes the amplitude 
Writ) reduce and phase angel 4 change, derived from equation (3). Supposing the 
changed values are y’,;(¢) and “’ respectively; suspension force vector F'y(t) becomes 
F,(t), phase angel A- becomes /-y', suspension force vector F(t) becomes F (1), 
phase angel A+ becomes A+’. 

According to geometry knowledge, the triangle AF\(t+1)OF (0), A Wo(t+ DO wd) 
and AF,(t+1)OF,"(1) are three mutual similar triangles in Fig. 2. So, the angle be- 
tween vector Y(t) and vector F'(f) can be written as follow. 


A-(A-W) =u (5) 


At the same times, the angle between vector y(t) and vector F(t) can be written as 
follow. 


A-(At+ pf) =— (6) 


Assuming kpy=kyiWimi(), kKr=kvwoW mild), the triangle A Y%o(t+1)O Y2(t) is equivalent 
to triangle AF\(t+1)OF,'(t), which takes O as the endpoint with the counterclockwise 
rotation angle yw’, and its sides length are reduced to I/kp,; times as well as is 
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equivalent to triangle AF,(t+1)OF,"(f), which takes O as the endpoint with the coun- 
terclockwise rotation angle -“', and its sides length are reduced to 1/ kp) times. So 
vector A y(t) is equivalent to vector AF;(t), which takes O as the endpoint with the 
counterclockwise rotation angle jv’ and its sides length are reduced to 1/kp, as well as 
is equivalent to vector AF(t), which takes O as the endpoint with the counterclock- 
wise rotation angle -' and its amplitude is reduced to 1/ kg, times. 

Rotating coordinates transformation can be used to calculate the mathematic rela- 
tionship about AF), AF,and A yw. Fig. 3 shows the coordinates relationship increment 
of suspension force AF, AF, and increment of flux linkage AY. So, AF}, and AF ig 
are determined from of components of suspension force windings flux linkage AYou 
and A Yo as follows. 


AF, cos’ sinwl’ \( AYoog, 
di Far pe pes a O 
AF, —sinw’ cos’ )\ AWo, 
Accordingly, AF, and AF>g are determined from a components of suspension force 
windings flux linkage Ay, and A Woz as follows. 


AF, =p cos Ll’ —sin yw’ AW oo0 8 
AF,, Pl sin’ cos’ AWoog (8) 


The error of radial suspension force AF includes AF, the error of radial suspension 
force F, and AF; the error of radial suspension force Fz. So, the AF from of compo- 
nents can be written as follows. 


ie 3 [ (Ke, +kp)cos (kp, — kp) sin ‘) r i 
AW 


(9) 


AF, 


—(ky, — kp) sin uw (Kp + Kp) COS uw 


ee Vo(t+1) 


Fig. 2. Vector diagram of suspension force Fig. 3. Connected vector diagram of incre- 
and flux linkage ment of suspension force AF,, AF and incre- 
ment of flux linkage Ay 


3.3. DSFC System Design 


Fig. 4 shows the block diagram of DSFC system. It can be seen that torque windings 
flux linkage calculator can real time calculate torque windings air-gap flux linkage 
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amplitude y,,, and phases. The increments of rotor radial displacements are regulated 
by PID controllers to produce the command values of radical suspension force Fy, 
Fg. Suspension force windings flux linkage estimator calculates the amplitude yx 
and phase, which can work out the feedback value of radical suspension force Fo, Fz 
by combining with the amplitude y,,; and phasey. Then the increment of suspension 
force windings flux linkage AY%2_,and A Yop can be deduced by equation (9), accord- 
ing to the increments between radical suspension force Fy, Fz and its command values 
Fy, Fs. Finally the appropriate voltage space vectors are selected for suspension 
force windings, and voltage source inverter switching signals are generated by SVM. 


4 Simulation Results 


Control system of the BSynRM including torque control subsystem and radical 
suspension force control subsystem is constructed. Control strategy of torque control 
subsystem adopts the DTC based on SVM. DSFC algorithm is used to direct radical 
suspension force control subsystem. In order to verify the DSFC algorithm, the 
schematics shown in Fig. 4 can be implemented and simulated in Matlab6.5-Simulink 
environment.In this system, variable step size selects ode23t, and simulation time 
selects Os to 0.1s. Simulation main parameters are given as follow. 

Motor parameters: rotor quality m is 1kg, rotational inertia J is 0.002kg-m’, air-gap 
& is 0.25mm, and auxiliary machinery bearings 6, is 0.20mm. Torque windings: rated 
voltage is 240V, rated current is 4.8A, and pole pairs py is 2, stator resistance R, is 
0.25Q, direct axis inductance Ly is 35mH, cross axis inductance L, is 4.2mH. Suspen- 
sion force windings: pole pairs pz is 1. 


A Vooe 


SO PID BO aha ie \ 
= n-yy ‘ F, at SVM >| motor | 
Yeo] pip ee p| / 
ti, Aap 
a Suspension 


Suspension }t force windings 


force Ws] flux linkage 
calculator ‘*#——J catculator 


Yuk HA Torque 
windings flux 
linkage 


calculator 


Torque control(SVM-DTC) 


Fig. 4. Block diagram of direct suspension force control system 


The motor load torque has been set to 1N-m, speed is 1500r/min at starting, then 
added to 1800r/min at t=0.03s, the Fig. 5 (a) shows the speed output characteristics, 
speed overshoot is under 0.4%, and the fluctuating error of speed in steady state is 
less than 10r/min. The speed has been set to 1500r/min, motor load torque is 1N-m at 
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starting, then added to 3.5N-m at 0.07s. Fig. 5 (b) shows the performance of the tor- 
que, the pulsating movement of torque is less than 10%. Motor torque control system 
has good speed performance shown in Fig.5 (a)-(b). 


006 008 01 9020 


oe times rime? 9 0.08 on 


(a) Speed output characteristics (b) Torque performance 


Fig. 5. Simulation results of the torque control system 


The starting position x-axis offset is-0.15mm, y-axis offset-0.10mm; Fig. 6 (a) 
shows the radial displacement in x axis; Fig. 6 (b) shows the track for the mass center 
of the suspended rotor. Although at t=0.07s step change of load torque from 0 to 
3.5N.m, the radial displacement cannot be interfered by the load torque, indicating 
DSFC control algorithms realize electromagnetic torque and radial suspension force 
decoupling. Fig. 6(c) shows the dynamic response curve of x, y-axis radial displace- 
ment, with step change of x-axis from -0.15mm to 0.06mm at t=0.06s and y-axis from 
-0.1mm to -0.04mm at t=0.04s. It can be seen that the control system has good de- 
coupling performance, torque and rotor radial suspension force can be controlled 
independently shown in Fig. 6 (a)-(c). 


x-axis radical displacement/mm 


0002 «O08 008 008 00s 006 
‘times time's 


(a) x axis radial displacement (b) Rotor suspend (c) radial displacement response curve 


Fig. 6. Simulation results of the control system 


5 Conclusions 


Based on the relation of flux linkage and suspension force of the BSynRM, the direct 
suspension force control (DSFC) method is proposed. The ideas of DTC are applied to 
a BSynRM system to control suspension force directly. The control algorithms of 
DSFC are deduced, and the simulation models of control system are established in 
Matlab-Simulink. The simulation results have shown that the method is valid, and the 
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control system has a good static and dynamic performance. The rotor can be suspended 
stability, torque and radial suspension forces can be controlled independently. 
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Application of Wireless Sensor Network for Monitoring 
Water Temperature Difference in Blast Furnace Cooling 
System* 


Yuefeng Liu and Jie Jiang 


Information Engineering Institute, Inner Mongolia University of Science & Technology, 
Baotou, Inner Mongolia, China 


Abstract. The wirless sensor network is a new kind of technique for getting or 
processing information. It’s a new try for WSN with application to the blast 
furnace cooling system. This paper initiate the accomplishment of sensor nodes. 
And describe the construction and design of monitoring system. The system can 
monitor the water temperature difference of a blast furnace cooling system in 
real-time, and provide the blast furnace with basic data for its safety production. 


Keywords: Wireless sensor network; blast furnace; cooling system; water 
temperature difference. 


1 Introduction 


At present, many large or middle blast furnace in china universally used the direct 
labor method monitoring water temperature at regular time. It’s very dangerous to 
the workers because the blast furnace will leak gas everywhere at last. Sometimes the 
furnace jar had not been heated evenly during the blast furnace metallurgy, and the 
local edge gas stream bloat excessively so that water temperature increased rapidly. If 
this phenomenon can’t be found in time, it will bring about the fatal accident such as 
furnace jar or furnace body may be burned to penetrate or explode and so on[1][2]. 

The shortcomings of direct labor monitoring are more workload, not in time and 
unsafe. Paper [1] and paper [2] proposed that using the DCS (Distributed Control 
System) to solve this problem, and effectively bridge the gap of direct labor method. 
But the sensor signal is transmitted by long pull wires, these wires will be damaged 
inevitably in the high temperature, so the maintain cost will be very high. Further- 
more, temperature measuring points were distributed broadly and multitudinously, 
this made the wire layout very elaborate and installation cost also very high. There- 
fore, this paper mainly discussed how to accomplish the application of 
WSN(Wireless Sensor Network) for monitoring water temperature difference in blast 
furnace cooling system. 
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2 Wireless Monitoring Technic for Cooling System Water 
Temperature Difference 


The temperature sensor uses bridge mode Pt! 000 thermistor of 1/3DIN grade, piping 
collection design used the standard pipe reed, installation and maintenance is very 
convenient [2]. With our own developed sensor nodes collect the temperature of every 
points, then every node send the data as particular rule to its own cluster head in the 
cluster. The cluster head collect all the data of sensor nodes in its own cluster, and 
then send these data to the relay node. Finally the relay nodes send the data to sink 
node in the operating room. See the structure in Figure 1. 


____ Cluster 2 


ClusteL \ eo 
e@ 
\ 4 Sink node 
\ 4. 
eo 4 ae 
\ @ os | 
7 an ee - wt 
i ; = a 7 @ Sensor node 
Cluster OF i. @ / 
matt See e@ oe @ Cluster head node 


@ 
Fig. 1. System Structure of Monitoring Temperature 


WSN can not only decrease the costs of installation and maintenance, but also can 
share some function of multiple path control unit under the system of DCS. 16 path of 
sensor signal are collected in one control unit before, it is changed now as the mode 
that every sensor node acquire the signal each other so that the data is more accurate 
and collected timely. According to each sensor node is a small processor, it has 
enough ability to compute and process. In this view, compared with DCS, WSN is a 
distribution system with strong fault tolerance. 


2.1 Application Requirement Analysis 


1) Water temperature measuring precision is +0.05°C; 

2) Temperature resolution ratio is 0.01°C; 

3) There have been distributed more than 300 temperature measuring points in the 
area of 200 meter long, 20 meter width and 30 meter high around the blast furnace; 

4) The distance between sensor nodes and operating room is 250 meter the most far 
and 50 meter the nearest; 

5) Sensor node can work continuously for five months with four batteries. 


2.2 System Sensing Reliance 


System sensing result is based on the water temperature difference between the inner 
water and outer water in the cooling wall and the data of thermal stream strength, and 
finally is calculated by the administration computer. 
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Thermal stream strength: 


J=(VxAtxCx1000)/S (1) 


In the formula above, J is thermal stream strength; V is current velocity; At 
( At=t_out—t_in ) is water temperature difference, t_out , t_in is outer 


water temperature and inner water temperature; C’ is water specific heat; S' is cool- 
ing wall area [1]. 


2.3 Water Temperature Sensing Operational Principle 


Constant-current source provide the electric current to bridge mode temperature sen- 
sor, and then invert the temperature variation of medium touched by sensor to the 
corresponding output voltage variation. Then the analog signals are sent to the A/D 
converter, amplified, converted and digital filtered. The measuring temperature data is 
sent to administration computer through WSN after linearization processed by single 
chip microcomputer. 


2.4 Water Temperature Sensing Error Analysis 


The temperature measuring error is very important to system. It is the key that how to 
reduce and calibrate error. The error is related to the grade of Pt thermistor and the 
nonlinearity of bridge circuit. The measuring bridge circuit is the signal convert link 
in the system, whose error is more important in the error of whole measuring system. 
Even using the Pt1000 sensor whose grade is A and allowed error is +0.15°C, the error 
is also beyond the demand of system sensing precision. The Pt thermistor error be- 
comes the main part of measuring error. So the system chooses the special Pt1000 
which its grade is 1/3DIN, in order to reduce the sensor error. Besides, the system 
software also correct the nonlinearity of bridge circuit. According to “Industry Pt 
Thermistor Separated Table”, using the standard resistance box calibrate temperature 
parameter in several section, and make it linear. Temperature measuring system send 
the temperature data after it is linear processed in several section and with multiple 
junction to the administration computer which shows these data. This intelligent func- 
tion can further improve the measuring precision. 


3 Sensor Nodes Design 


In this paragraph we design the hardware device of sensor motes, and emphasis the 
realization that how to sense water temperature. The sensor node structure is illu- 
strated by the Figure 2[3]. 

The key of sensor node design is that the sensor is very hard to reach the demand of 
water temperature sensing precision. In china some blast furnace water temperature 
difference is demanded between 0.3°C and 0.4°C. It is proved in practice that the 
system could be normal applied in blast furnace only water temperature sensing preci- 
sion reach to 0.05°C [1]. 


412 Y. Liu and J. Jiang 


TOCCSSOL 
node | 

1 |] ATi@Ga fi 
Le — ze |) i ; = 
SENSOR mT H i ATTaDo ie Met 7) WAC > C1100 | 
i M1 I 


Sensor mode! fireless commnication model 


Power model 


Fig. 2. Wireless Sensor Mote Structure 


3.1 Wireless Communication Model 


Radio chip chooses Chipcon’s CC1100. CC1100 is a low cost true single chip UHF 
transceiver designed for very low power wireless applications. The circuit is mainly 
intended for the ISM (Industrial, Scientific and Medical) and SRD (Short Range De- 
vice) frequency bands at 315, 433, 868 and 915 MHz. Programmable data rate is up to 
500 kbps. It can improve the transmission capability by using inner antecedent error 
correction, thus can be applied in the wireless ugly surroundings “!. 


3.2 Processor Model 


CPU chooses the ATMEAL’s ATMEGA128. This microprocessor has abundant on- 
chip source and interface. 


3.3 Sensor Model 


The sensor is in charge of the function sensing inner water temperature or outer water 
temperature of measuring point. The sensor modular circuit is illustrated by Figure 3. 
The temperature signal acquired by sensor is converted through stable bridge circuit, 
amplified by the differential input circuit which consists of interface chip MAX492, 
transmitted to ADC1 interface of ATMEGA128, so thus the final temperature data is 
collected. The reason why analog signal do differential input is that it has very high 
common-mode rejection ratio for even times harmonic wave, it can improve the cir- 
cuit function. There are also other advantages with differential input. First, it has high 
common-mode rejection ratio for noise signal cause by power supply and ground; 
Second, it has strong restraint function for in-phase signal cause by original vibration 
feedback. 

The sensor of this system uses the bridge mode Pt1000 thermistor whose grade is 
1/3DIN. Pt thermistor is a temperature sensor used very widely. Because of the nonli- 
nearity of Pt thermistor, it must be corrected and calibrated in application. So we design 
a linearity calibration algorithm against the nonlinearity between temperature and elec- 
trical potential. Calibration uses segmentation linearity algorithm (segment every ten 
degree), improve the precision of measuring temperature; We also design a filter algo- 
rithm about the interfering signal. First continuously collect 16 instantaneously tempera- 
ture data, then sort these data from low to high, for expel the interference data as 
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Fig. 3. Sensor circuit 


possible, we respectively detract the biggest 3 number and the smallest 3 number, then 
figure out the average of the middle 10 number. It is practically proved that this filter 
algorithm can guarantee 2 numbers follow the decimal point keeping stable. 


4 System Structure and Design 


Blast furnace monitoring system for water temperature difference in cooling system 
consists of sink node, relay nodes, cluster heads and sensor nodes. See the structure in 
Figure |. There are more than 300 sensor nodes, each cluster contain about 30 sensor 
nodes, sensor nodes communicate with sink node by cluster head and relay nodes. 
Every sensor node is assigned the cluster head number and index. This can assure the 
unique identity. Sensor nodes cyclical send the data to cluster head with TDMA 
mode, then the cluster head send data to sink node through the relay nodes, different 
cluster work under different TDMA mode. This can avoid channel collision. The 
following declaration is about the function of sink nodes, cluster head and sensor 
nodes. 
The function of sink node is: 


1)collecting the cluster head data; 

2)maintaining the cluster heads time synchronization; 

3)decreeing (user command, administrator command) to cluster heads; 
4)sending data to administration computer; 

5)accepting the order of administration computer. 


According to the received data, the client judges whether the temperature is normal, 
whether the battery voltage down to the critical point, and whether need to alarm. The 
administration computer also has the function of data backup and data query. 

The function of cluster head is: 


1)accepting and confirming the sensor nodes data; 
2)maintaining the time synchronization of sensor nodes in its own cluster; 
3)sending data to sink nodes. 
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The sensor nodes are located in every part of blast furnace. Its main function is to 
acquire the temperature data at regular time and send the data to cluster head on the 
basis of regulation time-slot. 


5 Conclusion 


The costs of installation and maintenance are very low, the precision of measuring 
temperature is very high, and the expansibility of sensor nodes function is very strong. 
It can fully attain to the demand of production technology, and it has practical sense 
in guidance of operating blast furnace, assuring the production safety and prolonging 
the furnace life and so on. 
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Abstract. The digital control technology of a bearingless permanent magnet 
synchronous motor (BPMSM) includes torque control technology and radial 
suspension force control technology. On the basis of explaining the producing 
principle of radial suspension forces, electromagnetic torque equation and radial 
suspension force equations of the BPMSM are given. Based on the functional 
block diagram of the BPMSM control system, optimized software structure of 
the control system is designed, realizing methods of its various functional 
blocks and management mechanisms of two kinds of critical resources in 
TMS320LF2407A DSP are put forward. The experiment results show that, 
adopting this software structure and the realizing methods and management 
mechanisms, the functions of the BPMSM torque control, suspension control 
and human-computer interaction can be implemented well. The software struc- 
ture and realizing methods and management mechanisms have important refer- 
ence value for developing digital control of bearingless motors. 


Keywords: bearingless motor; permanent magnet synchronous motor; torque; 
suspension; software structure; critical resource. 


1 Introduction 


A control system of bearingless permanent magnet synchronous motor (BPMSM) 
includes torque control subsystem and radial suspension forces control subsystem. 
The electromagnetic torque is produced by torque windings. The speed signal is 
gained by using rotary encoder or adopting sensorless operation technology, and then 
motor speed can be accurately controlled [1-2]. The radial suspension forces are pro- 
duced by suspension force windings. Through adding control current of suspension 
force windings, the generated magnetic fields are superposed to the magnetic fields of 
torque windings, unbalanced magnetic fields are produced and then radial suspension 
forces are generated. When the difference between the pole pairs of torque windings 
and suspension force windings is 1, single radial suspension force in one direction can 
be generated [1-3]. The rotor radial displacements can be measured by virtue of capa- 
citive displacement sensors or eddy current sensors [4-5]. The rotor can be suspended 
at the balance position by the closed loop control of rotor radial displacements [3]. 
Adopting digital signal processor (DSP), the rapidity requirement of response time 
can be satisfied for controlling rotor radial displacements [6]. 
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In the paper, based on setting up electromagnetic torque equation and complete 
radial suspension force equations of the BPMSM, optimized software structure of the 
control system is designed, realizing methods of its various functional blocks and 
management mechanisms of two kinds of critical resources in TMS320LF2407A DSP 
are put forward., software program of the control system is then developed. The expe- 
riment results demonstrate that the functions of the BPMSM torque control, suspen- 
sion control and human-computer interaction can be implemented well. 


2 Production Principle of the Radial Suspension Forces 


In order to produce controllable radial suspension forces, the pole pairs relationship 
between torque windings Py, and suspension force windings Pg should be Pg=Py+1 
[7]. Fig. 1 shows the production principle of the radial suspension forces. Additional 2- 
pole suspension force windings N, and N, are wound in the stator slots together with 
conventional 4-pole torque windings N, and N,. The radial suspension forces can be 
produced by the unbalanced magnetic flux density in the airgap caused by the interac- 
tions between 4-pole excitation magnetic fluxes ®, of permanent magnets and the 
magnetic fluxes generated by 2-pole suspension force winding currents i, and i,. For 
example, if the positive suspension force winding current i, exits in M, winding as 
shown in Fig. 1, the 2-pole magnetic fluxes ®, are generated. Therefore, the magnetic 
flux density increases in the airgap | while decreases in the airgap 3. Radial suspension 
force F is generated in the positive direction of x-axis. If the suspension force winding 
current is negative, a radial suspension force can be produced in the negative direction 
of x-axis. If currents exist in the M, winding, y-axis direction force can be produced. 


stator 


airgap 


\ , 4-pole 


windings 


ix, 2-pole 


/——9 windings 
AN ty 
permanent ° 


magnet B permanent 


4 magnet A 


Fig. 1. Production principle of the radial suspension forces 


It can be seen that the paths of 2-pole fluxes ®, pass through the permanent magnets 
A and B. It is well known that the permeability of permanent magnets is approximately 
equal to permeability in the air. Therefore, the thick permanent magnets result in large 
2-pole MMF (Magnetomotive Force) requirements to produce radial suspension force. 
Consequently, thin permanent magnets are preferred to generate radial suspension 
force efficiently. However, thick permanent magnets have advantages to achieve 
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reasonable magnetic flux density for motor performance as well as to avoid demagne- 
tization. So there exists an optimum thickness of permanent magnet to produce radial 
suspension forces most efficiently. The relationships between the radial suspension 
forces and permanent magnet thickness are described in the next section. 


3 Mathematics Model of the BPMSM 


In this paper, a two-phase machine model is used for simplicity, though a three-phase 
is practical. The current and magnetic flux linkage of the BPMSM are shown in a syn- 
chronous rotating reference coordinates. The relationship among the radial suspension 
forces and the currents of suspension force windings can be expressed as 


fs = (Ky + K,)- (aa Wa +iy, YW) (1) 
Fy =(K, + Ky): (i, Wa ~ loa Wi) 

where, Fj,, Fiy are the radial suspension forces which are made up of Maxwell forces 
and Lorentz forces. Ky is Maxwell forces constant. Ky, is Lorentz forces constant. i2,, 
ing are current components of suspension force windings. Yjg, Wi, are the airgap mag- 
netic flux linkages components of torque windings. 

In addition, according to the theory of Electromagnetic Field, when the rotor is out 
of the center, another radial suspension force will be generated. This effect is known as 
the magnetic tensile force in the electromagnetic field of electrical motor. The generat- 
ed Maxwell forces F;,, F’, are in proportion to the displacement. These inherent forces 
can be written as 


F,. = kx 
i = k, y @) 
2, 
where, k, =k- mrlB , k, is the force-displacement coefficient. 4 is the vacuum per- 
Ho 


meability. 6 is the airgap length. k is the attenuation factor, k~0.3. 
So the radial suspension forces F, and F’, in x- and y- direction can be expressed as 


F. = F. 27 F. 
(3) 


aes ae 
Substituting expression (1), (2) into (3), so expression (3) can be written as 


[ =(KytK,)- Ga Yu +h, ‘YW, +k, “x 


ee * 4 
FE, =(K, Ky): (og Yaa by Wi )+k,y 


When Pg=Py, +1, expression (4) can be written as 


: =(Ky+K) Gu Wa ti, : Y,)+k, x 


' ; 5 
Fo =(K, + Ky) Gy Ya —ing Wy )tk,-y - 
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When Pg=Py -1, expression (4) can be written as 


F.=(Ky-K,)- Goa Ya ti, °W,)+k, “x (6) 
F =(K, -Ky)-G, Wa mba ‘YW, )+k, y. 
The stator magnetic flux linkage equation is as follows 
Wig = Lyi +V, 
ae (7) 
Wi, “ qq 


where, Yq and y, are all airgap magnetic flux linkages. y, is rotor magnetic flux 
linkage. Lz and L, are the self-inductances of torque windings in the 2-phase rotating 
coordinates, respectively. 


4 Control System Structure of the BPMSM 


Functional block diagram of the BPMSM control system software is shown in Fig. 2 
(when Pg=Py+1). In this diagram, the control method, that is, adjusting voltage by 
enhancing magnetic flux is adopted in speed loop, X;ef and yrep are the given rotor radial 
displacements in the x-axis and y-axis. w is the command value of angular velocity. 0, 
denotes the initial phase angle between A-phase axis of torque windings and x-axis, 0, 
denotes the initial phase angle between A-phase axis of suspension force windings 
and x-axis. 
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Fig. 2. Functional block diagram of the BPMSM control system software 


The equation of PARK inverse transformation of the currents of suspension force 
windings can be written as 


baleen +A0,) —sin(P,at+A6, + AO, ) la 


sin(P,at+A0,+A@,) cos(P,att+ AG, + AG, ) ®) 


iq 
where, « is the rotor’s actual angular velocity, ¢ is the time, A@, is the space mechani- 
cal angle among torque windings, suspension force windings and x-axis, A@> is the 
advanced electrical angle of torque windings airgap magnetic field. When Pg=Py+1, 


Digital Control Technology of Bearingless Permanent Magnet Synchronous Motor 419 


AO, = P, (0, —9,) +6, = P.O, —(P, 1), = P,0, — P,4, (9) 


When A-phase axis of torque windings and A-phase axis of suspension force wind- 
ings superpose, namely, 0;=0,=0, then 


AO, = P,0, — P,6, = P,O— P,0 =(P, —P,)@=+0 (10) 


The control system hardware of the BPMSM mostly includes four parts, that is, DSP 
controller, power amplifier, BPMSM interface circuit board and computer system. The 
functional block diagram of the control system hardware of the BPMSM is shown in 
Fig. 3. In this diagram, i*y and i*g are the modulated given currents of torque wind- 
ings and suspension force windings, i y and i, are the demodulated given currents of 
torque windings and suspension force windings. iy and ig are the practical currents of 
torque windings and suspension force windings. 
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Fig. 3. Functional block diagram of the BPMSM control system hardware 


In the control system, the functional modules are as follows: OBPMSM; @rotary 
encoder; @adjusting circuit of rotary encoder’s signals; ® quadrature encoding unit; 
®rotor radial displacement sensor; @adjusting circuit of rotor radial displacement 
sensor’s signals; @ A/D convertor; ® processing, computing and controlling program; 
@PWM signals generator; (demodulating circuit of signals; generating circuit of 
power switch control signals;@2 commutator, filter and inverter unit; malfunction 
control logical circuit; @current sensor; @adjusting circuit of current sensor’s sig- 
nals; (human-computer interaction interface. 


5 Control System Software of the BPMSM 


The equations of PARK inverse transformation and CLARK inverse transformation of 
torque windings’ currents adopted in the BPMSM control system in the paper are as 
follows 
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ina | _ cos(P,@t) —sin(P,art) | isa hic 
Lie sin(P,@t) cos(P,at) | | iv, 

1 0 
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" = 2 _l v3 : ‘wa (12) 

es ce ae sede 
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The equation of CLARK inverse transformation of suspension force windings’ cur- 
rents can be written as 


; 1 0 
i 
is = 2 _l v3 ' tba (13) 
ae BB i 
Igc 1 3 
PB oe! 


Fig. 4 shows the flowchart of this control system program, from the Fig. 4, the 
control program of motor synchronous rotation includes: calling subprogram of calcu- 
lating sine value and cosine value of feedback rotor position angle, gaining torque wind- 
ings’ current value calculated by speed loop PI(Proportion Integration), calling PARK 
inverse transform subprogram, calling CLARK inverse transform subprogram and 


nr 
<< Reset System > 


Jump to the entrance of resloraten 
and miuialization program 
Disable all mlerrupts 
Disable and clear Watchdog 
v 
Call system iniualizalion program 
Jump ty the entimmce of system 
coiguralion provram 


Initialize 
variables, constants. sofware 
slalus control regisler and coefficients 


¥v 


Initialize receiving decoding vector table 


v 


Initialize Wansnaliuing descriptor table 


Jump le the entrance of syslem 
ImWalizalion preenan 


Call Cl inualizauion prozram 


v 


Call mer configuralion program 


v 


Call PWM module configuration program 


v 


Call retor iniual onenialion program 


Call quadrature encoding and 
caplure unil configuralion program 
Call A/D convertor unit 
configuraiion program 


i —— 


Jump to the entnmee of 
Tham Loop program 


Call the control program of 
moelor syuehronous rolulion, 


Vv 
Call the control program of 
Tulor suspension 


Fig. 4. Flowchart of the BPMSM control system program 
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calling sine wave PWM driving subprogram of torque windings. In this program, the 
sine value and cosine value of feedback rotor position angle are calculated through 
the above-mentioned method of look-up table. The control program of rotor suspen- 
sion includes: getting corresponding sine value and cosine value based on the sum of 
feedback rotor position angle and AO, A@(By default this program runs without 
motor load, A@,=0) through the look-up table method(sharing the same sine table with 
the control program of motor synchronous rotation), gaining suspension force wind- 
ings’ current value calculated by rotor radial displacement loop PID(Proportion Inte- 
gration Differentiation), calling PARK inverse transform subprogram, calling 
CLARK inverse transform subprogram and calling sine wave PWM driving subpro- 
gram of suspension force windings. 

Critical resources which have been used in this program include two forms, one is 
that can be saved and then restored, the other is that can not be restored. The former 
involves accumulator ACC, auxiliary register ARO~AR7, status register STO and ST1, 
and this kind of critical resource is managed through locale information protection 
mechanism. The latter involves TREG register and PREG register of multiplier, and it 
is managed through controlling interrupt mode. Namely, before main program uses 
these resources all the related interrupts are disabled, and after using them, the corres- 
ponding interrupts are enabled again. This management mechanism of controlling 
interrupts makes program execution cause a period of delay. But the delay can be 
reduced to a minimum through designing program structure reasonably and enabling 
corresponding interrupts opportunely. The experiment results show that this manage- 
ment mechanism is viable, and the delay time can be controlled within 10 machine 
cycle under the condition of assuring program execution in the maximum efficiency. 


6 Experiment Analysis 


When the BPMSM operates steadily, the waveforms of given and feedback C-phase 
current of torque windings are shown in Fig. 5 (gained by the output voltage signal of 
eddy current sensor). Fig. 6 is the rotor’s displacement wave in x-direction in the 
BPMSM (through detecting output voltage signal of eddy current sensor’s interface 
circuit). From Fig. 6, it is obvious that rotor can be limited to the central position. 
Because of the factors of mechanism imbalance, and so on, rotor is recurrently di- 
vorced from the central position. However, it doesn’t affect the validity and feasibility 
of suspension control. 
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Fig. 5. Waveforms of given and feedback Fig. 6. Waveform of rotor displacement in 
C-phase current in torque windings. x-direction. 
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7 Conclusion 


This paper presents electromagnetic torque equation, radial suspension force equa- 
tions and functional block diagram of control system of a BPMSM. On these bases, 
optimized software structure of the control system is designed, and realizing methods 
of various control parts and management mechanisms of two kinds of critical re- 
sources in TMS320LF2407A DSP are researched. The experiment results show that, 
this software structure and the realizing methods and management mechanisms can 
satisfy all functions of the BPMSM control and lay a foundation for developing digi- 
tal control of bearingless motors. 
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Abstract. This paper presents a new method which combines empirical mode 
decomposition (EMD) and support vector machine (SVM) together for bearing 
fault diagnosis in low speed-high load rotary machine. EMD is a novel self- 
adaptive method which is based on partial characters of the signal. Vibration 
signal measured from a defective rolling bearing is decomposed into a number 
of intrinsic mode functions (IMFs), with each IMF corresponding to a specific 
range of frequency components contained within the vibration signal. Then cal- 
culate the energy entropy mean of each IMF and normalization motor 
speed(RPM) to construct feature vector to train SVM classifiers. The results of 
application in simulation signal and practical bearing fault signal both show its 
efficiency. 


Keywords: EMD, IMF, SVM, feature vector, fault diagnosis, ball bearing. 


1 Introduction 


Bearing is one of the most important components in machine. Once it has any sort of 
faults, it would lead to serious economic lose. The traditional fault diagnosis tech- 
niques mainly contain demodulation and envelope-based method. But both have 
drawbacks. The first one is that bearing defect characters always perform as non- 
linear and non-stationary. Secondly compared with the complex noise the bearing 
defect characters are very weak[1-2]. They are usually submerged in the noise signals 
and hard to be picked out. Finally, by using envelope-based method, it suffers from 
the drawback of having to determine a proper filtering width, in order to obtain con- 
sistent results under varying operating conditions. Therefore, it is quite difficult to 
extract the defect characters just by using traditional techniques. 

Recently, a new signal analysis method, namely empirical mode decomposition 
(EMD) developed by Huang et al., has been based on the local characteristic time 
scale of the signal and can decompose the complicated signal into a number of intrin- 
sic mode functions[3]. By analyzing each resulting IMF component that involves the 
local characteristic of the signal, the characteristic information of the original signal 
could be extracted more accurately and effectively. In addition, the frequency compo- 
nents involved in each IMF not only relates to sampling frequency but also changes 
with the signal itself; therefore, EMD is a self-adaptive signal processing method that 
can be applied to nonlinear and non-stationary process perfectly. 
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In this paper, EMD is applied to the ball bearing fault diagnosis. The original 
acceleration vibration signal is decomposed by EMD and some IMF components are 
obtained, then the concept of energy entropy mean is introduced, which defined by 
calculating the mean value of the vibration signal entropies of a bearing with a fault in 
different various speeds a loads. Then we use the IMF energy entropy mean as an 
input of SVM classifiers to identify the bearing faults. 

The theories of EMD and SVM are introduced in Section 2. Section 3 proposes the 
method combined EMD and SVM to bearing fault diagnosis. Section 4 is an experi- 
mental verification of the proposed method, as it is engaged in a bearing fault diagno- 
sis. The conclusions are given in Section 5. 


2 EMD Algorithm and SVM 


2.1 EMD Algorithm 


The EMD method is a fully data driven approach. Since the decomposition of the 
EMD is based on the local characteristics time scale of the data, it is applicable to 
nonlinear and non-stationary processes. The EMD decomposes into a sum of IMFs. 
An IMF is a function that must satisfy the following two conditions [4]. 


i. The number of extrema must either be equal to, or at most differ by one from the 
number of zero crossings. 

ii. The mean values of both the envelope defined by the local maxima, and the 
envelope defined by the local minima, are zero at any point in the data. 


The sifting process is defined by the following steps: 


Given a signal x(t), the effective algorithm of EMD can be summarized as follows: 
1) Determine all the local maxima, and local minima from the signal x(t) 


2) Perform interpolation so that an upper envelope, Xp (t) , and a lower envelop, 
bk ) , can be formed by all of the local maxima, and local minima respectively. 


3) Obtain the mean, ™m(f) , of the upper, and lower envelopes using 


en Si r 


4) Extract the detail h,(t) = x(t)—m(C), the index 7 is the number of the relevant 


IMF. If h,(t) could satisfy the two IMF conditions (that is, conditions i, and ii as 


defined before), then it is the valid IMF. Otherwise, it is not a valid IMF; iterate the 
above steps until a new IMF, which satisfies the two conditions, is found. 


5) Define the residue 7, (t) = x(t) —h,(t), the index n is the number of the relevant 
IMF As Ff, (t) still contains much information from the lowest frequency; replace 


x(t) with r,(t), and repeat the sifting process until the amplitude of the residue is 
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lower than a predetermined threshold, or it contains the lowest frequency component 
of the signal x(t) .The original signal can be precisely reconstructed using 


x(t) = > h(t)+r,(t) (2) 


2.2 SVM 


SVM is developed from the optimal separation plane under linearly separable condi- 
tion. SVM uses a hypothesis space of linear functions in a high dimensional feature 
space to estimate decision surfaces directly rather than modeling a probability distri- 
bution across training data.[5-6] It uses support vector kernel to map the data from 
input space to a high dimensional feature space which facilitates the problem to be 
processed in linear form. SVM always finds a global minimum because it usually tries 
to minimize a bound on the structural risk, rather than the empirical risk. The struc- 
tural risks, defined as a structure derived from the inner class of the function in the 
nested subset, find the subset of the function that minimizes the bound on the actual 
risk. SVM achieves this goal by minimizing the following Lagrangian formulation: 


1 i 1 
L=—llalt -) ay,(x,eo@+b)+ >a, (3) 
i=l i=l 


Where @, is positive Lagrange multiplier. SVM uses some kernels to map the data 


from the input space to a high dimensional feature space which facilitates the prob- 
lem to be processed in linear form. In this paper linear, radial basis function (RBF), 
quadratic and polynomial kernels have been used. 


3 Fault Diagnosis Combined EMD and SVM 


While the ball bearing with different faults is operating, the corresponding resonance 
frequency components are produced in the vibration signals, and here the energy of 
fault vibration signal changes with the frequency distribution. To illustrate this change 
case as mentioned above, the energy entropy mean concept is proposed here. 


If n IMFs and a residue 7, (ft) are obtained by using the EMD method to decom- 
pose the ball bearing vibration signal x(t) where the energy of the n IMFs is El, 


E2,...,En, respectively; then, due to the orthogonality of the EMD decomposition, the 
sum of the energy of the n IMFs should be equal to the total energy of the original 


signal when the residue 7,(¢) is ignored. As the IMFs h,(t) include different fre- 


quency components, E={ El, E2,....En }, forms an energy distribution in the 
frequency domain of ball bearing vibration signals, and the corresponding EMD 
energy entropy is 


H yy =->_ p,log p, (4) 


i=l 
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Where p, = Ei/ E is the percent of the energy of H,(t) in the whole signal energy, 


and E = > E, .Then energy entropy mean concept is as the mean of energy entropy 
i=l 
of vibration signals collect from a faulty ball bearing in various speeds and loads. 
Therefore, IMFs’ energy entropy mean could be used as fault feature vectors. After a 
fault feature vector has been extracted, SVM could be chosen as classifier to identify 
the work condition and fault pattern of bearing. 
The EMD and SVM method for fault diagnosis can be summarized as follows: 


1) Sample N times at a certain sample frequency fs under the condition that the 
bearing is normal, ball fault, inner-race fault, out-race fault respectively. And the 
4N signals are taken as samples that are divided into two subsets, the training 
samples and testing samples. 

2) Each sample signal is decomposed by EMD method and calculated energy entro- 
py mean for each IMFs to construct one initial feature vector. 

3) Each sample’s motor speed(RPM) is normalized to construct another initial fea- 
ture vector. 


Design SVM classifier. The energy entropy mean and normalization RPM of the 
initial feature vector of the training samples are used as the fault feature vectors to be 
input to the SVM classifiers and the classifiers are trained. 


4 Experimental Verification 


In order to verify the efficiency of our method, we adopt fault test data from the Case 
Western Reserve University Bearing Data Center. Experiments were conducted using 
a 2 hp Reliance Electric motor, and acceleration data was measured at locations near 
to and remote from the motor bearings. Motor bearings were seeded with faults using 
electro-discharge machining (EDM). Faults ranging from 0.007 inches in diameter to 
0.040 inches in diameter were introduced separately at the inner raceway, rolling 
element (i.e. ball) and outer raceway. Faulted bearings were reinstalled into the test 
motor and vibration data was recorded for motor loads of 0 to 3 horsepower (motor 
speeds of 1797 to 1720 RPM). 

SVM classifiers are needed to design if four classes of bearing conditions are to be 
identified, like normal, with ball fault, with inner-race fault and with out-race fault. 
First of all, for SVM1, define the condition with normal as y =+1 and the other condi- 
tion as y = —1, thus the normal condition could be separated from other condition by 
SVM1. Then define the condition with ball fault as y =+1 and the other condition as y 
= —1 for SVM2, thus the ball fault could be separated from other condition by 
SVM72.Then define the condition with inner-race fault as y =+1 and the other condi- 
tion as y = —1 for SVM3, thus the inner-race fault could be separated from other 
condition by SVM3. Since we have known there are only three conditions to be iden- 
tified, the remaining should be out-race fault. The identification approach is the same 
as above, that is, extract 11 samples as training ones at random (3 samples with nor- 
mal condition, 4 samples with inner-race fault and 4 samples with out-race fault). The 
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Table 1. The identification results of proposed method for roller bearing 


Fault feature vector Identifica- 
Testing ar SVM1 SVM2 SVM3 tion 
Samples Puy DonnalZanon Distance 1 | Distance | Distance Results 
Entropy | RPM 2 3 

Normal 0.52845 0.5(1797) | 0.7120 Normal 
(+) 

Normal 0.48377 -0.5(1772) | 0.5469 Normal 
(+) 

Ball 0.43559 0.5(1797) -0.8321 0.6492 Ball 
(-1) (+1) 

Ball 0.40264 -0.5(1772) | -0.9762 0.7941 Ball 
(-D) (+) 

Inner- 0.49733 0.5(1797) | -0.5211 -0.8947 0.9213 Inner-race 

race (-1) (-1) (+1) 

Inner- 0.45937 -0.5(1772) | -0.498(-1) | -0.5876 1.0224 Inner-race 

race (-1) (+1) 

Out-race | 0.55867 0.5(1797) | -0.9561 -0.6412 -1.2246 | Out-race 
(-l) (-l) (-D 

Out-race | 0.51346 -0.5(1772) | -0.7476 -0.8454 -0.8298 | Out-race 
(-1) (-1) (-D) 


part identification results are shown in Table | from which we can see that two SVM 
classifiers can identify the working conditions and fault patterns of bearing 
accurately. 


5 Conclusions 


In this paper, original vibration signals based on EMD are preprocessed, IMF energy 
entropy mean are calculated. The energy entropy mean are extracted as training and 
testing samples of SVM. The final result shows that the proposed method of fault 
diagnosis for bearing based on EMD and SVM algorithms if effective. 
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Abstract. For the multisensor systems with unknown model parameters and 
noise variances, based on the system identification algorithm and correlation 
method, the estimators of model parameters and noise variances can be obtained. 
Based on the information matrix, a self-tuning centralized fusion Wiener filter is 
presented by substituting the estimators into the corresponding optimal filter. 
Using the dynamic error system analysis (DESA) method, it is proved that the 
self-tuning centralized fusion Wiener filter has asymptotic global optimality, i.e. 
it converges to the optimal centralized fusion Wiener filter. A simulation exam- 
ple applied to signal processing shows its effectiveness. 


Keywords: Self-tuning, centralized fusion, Wiener filter, information matrix, 
convergence. 


1 Introduction 


No doubt that the multisensor information fusion has become one of the most popular 
fields for its widely application [1]. The conventional information fusion methods 
based on Kalman filtering include the centralized and distributed fusion methods [2]. 
The former gives the globally optimal state estimation by combining all local mea- 
surement data, with the disadvantage of requiring a larger computational burden. The 
latter gives the globally optimal and suboptimal state estimations by combining or 
weighting local state estimators [3], respectively. It can facilitate fault detection and 
isolation more conveniently, and reduce the computational burden. The centralized 
fusion method based on information matrix presented in [4] can give globally optimal 
estimation and avoid calculating the high-dimension inverse matrix. 

In this paper, for the multisensor systems with unknown model parameters and noise 
variances, based on the information matrix approach, a self-tuning centralized fusion 
Wiener filter is presented. It makes up the shortcoming that the existing literatures of 
information filter are seldom and just for the system with unknown noise statistic [4,5]. 
It is strictly proved that the self-tuning fuser presented in this paper has asymptotical 
global optimality by the DESA method. 
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2 Problem Formulation 


Consider the linear discrete time-invariant stochastic system with L-sensor 


x(t+1)=@(A)x(t)+ Tw(t) . (1) 
y,(t) =H, x(t) +v,(t) ,§=1,2,---,L . (2) 


where f¢ is the discrete time, x(t)e R", y,(t)e RR”, w(t)e R’ and v,(t)e R™ are the 
state, measurement, input noise and measurement noise, respectively. ®(@), I and 


H, are constant matrices with compatible dimensions. The vector @€ R* denotes the 


unknown constant parameters in the state transition matrix ® , and each unknown 
element of ® is a continuous function with respect to 6. When @ is known, we de- 
note D(A)=O@. 


Assumption 1. (®,H;)is a completely controllable pair, (®,7) is a completely ob- 
servable pair, and ®@ is a nonsingular matrix. 


Assumption 2. w(t)e R’ and v,(t)e R” are uncorrelated white noises with zero means 
and variances Q>0 and R,>0. 


Assumption 3. The matrices H and I are known, vector @ and the noise variances 
Q and R, are unknown. 


Assumption 4. The measurement data y,(t) are bounded, i.e. a realization of the sto- 


chastic process y,(t) is bounded, i=1,2,---,L. 


The problem is to find the self-tuning centralize fusion Wiener filter x*(t|1) of the 


state x(t) based on the information matrix. 


3 Globally Optimal Centralized Fusion Wiener Filter 
From (2), the centralized fusion measurement equation is given by 
y(t) = Ax(t)+v(t) . (3) 
where 
wo=bio - yo! aelat - atl wosbto - viol’. (4) 
and the fused measurement noise v(t) has the variance as 
R=diag[R, -- R,]. (5) 


When @, Q and R, are known, from (1) and (3), the globally optimal centralized 
fusion Kalman filter is given by 


Self-tuning Centralized Fusion Wiener Filter with Applied to Signal Processing 431 


Mtlo =" (Hnxt-1lt-l)+ Ky) . (6) 
Y= PtlNs '(¢tlt-DO®, K()=PCtlNH'R"™ . (7) 
Pldin=Z¢lt-D)+HA'R'A ,ZStt+ll)=OP(tlNo'+7or' . (8) 


where inverse matrices P'(t|t) and Y'(tlt-1) are called information matrix. 


Theorem 1. For the multisensor system (1) and (2) with Assumption | and 2, the 
globally optimal centralized fusion Wiener filter is given by 


Wig REI) =M(q')Y HER y() « (9) 


where 
w(q')=det(I, —g' (1), M(q') = adj, —q° ¥O)P(tIn) (10) 
W(t), Plt It), X(t+114f) are given by (7) and (8), and P(t It) can also be given 


L 
by a new form Pldin=zr (tl t-l)+) A) RH, , which can reduce the computa- 
i=l 
tional burden because compared with the standard Kalman filter, it can avoid the 
computation of high-dimension inverse matrix. 


L 
Proof. From (3) - (7), R11) =P (HRO- Mt — + POD) HER; y, (1) is obtained, 


i=l 

LE 
which can be transformed into (J, — q W(t) X(t It)=P(tl D>, ye R y,(t) . It has been 

i=l 
proved that Y(t) is asymptotically stable [6], so 
(,-q'))' =adj(I, —g (0) / det, —g"¥(t) . Thus (9) and (10) hold. From 
(8) and applying (4) and (5), the new form of Pp (tt) can be obtained. This completes 
the proof. 


4 Self-tuning Centralized Fusion Wiener Filter 


When the model parameters and the noise variances are unknown, the self-tuning 
centralized fusion Wiener filter can be realized by the following steps: 

Step 1. Applying the system identification algorithm [7] (for example, RIV algorithm 
or RELS algorithm), the fused estimator A(t) of @ is obtained by taking the average of 
all local estimators. 
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Step 2. Applying the estimator A(t) and the correlation function method [8], the fused 


estimators O(t),R, (t) of the noise variances Q,R, are obtained in the similar way. 


Step 3. Substituting all the fused estimators into Theorem 1 yields the self-tuning 
centralized fusion Wiener filter as 


Wg ')k'(t11) = wry HR (t)y,(t) . (11) 

(q') = det, —q' (1) ,M(q"') = adj, —q YO) PCIt) . (12) 
P(t) = Pr NE "(tlt-NO(Q)) . (13) 

Ply = ECU) + HIRO, (14) 

Lr 1) =O) P(t H®™()+TOMI . (15) 


The above three steps are repeated at each time t. 


5 The Convergence Analysis 


Assumption 5. The parameter estimator A(t) and the noise variance estimators O(t) 


and R (t) are consistent, i.e. 
A(t) > 0,®((t)) > ®(6),O(1) 9 O,R,(t) 9 R,, as t30,w.p.l. (16) 


where the notation “w.p.1” denotes “with probability one” [7]. 


Theorem 2. For the multisensor system (1) and (2) with Assumption 1-5, the 
self-tuning centralized fusion Wiener filter (11) converges to the optimal centralized 
fusion Wiener filter (9) in a realization, i.e. 


[f° (t1t)-£(t11)] 30, as to, iLar.. (17) 


Proof. When Assumption 1-5 are satisfied, for the globally optimal centralized fusion 
Kalman filter (6) - (8), by applying the dynamic variance error system analysis 


(DVESA) method [9], it has been proved [4] that [S(¢+114)-E] >0 , 
[Pitlt)-P] 30, [K()-K]30, [¥()-¥] 30, as 15, iar, where X, P, 
K and © are the corresponding values in the stable Kalman filter and ‘¥ is a stable 
matrix, so K(t),(t) are bounded. Then from (10) and (12), it holds that 


Wq') > W(q'),M(q') > M(q"), as to, iar. 
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Setting W(q')=w(q')+AW(q"') , M(q!)=M(q')+AM(q"') then we have 
Aw(q')> 0, AM (q')>0, as to, iar.. Defining d(t) =[(¢11)- 210), 
subtracting (9) from (11) yields the dynamic error system 


W(q )d(t) =u(t) . (18) 


u(t)=M(q"' E> HT (R'()-R')y, QF AM (q)> APR Oy, O-AWG DECI ~~ — (19) 


= i=l 


L 
Assumption 4 and 5 yield that SAR (t)y,(t) is bounded. Then from (11), it can 


i=l 
yield that x°(¢1t) is bounded [10]. Noting that W(q') is a stable polynomial, which 
can be obtained from (10) and the stability of ‘¥. And it yields (R'(t)—R,') >0 


L 
from Assumption 5, i.e. M(q')[)H/ (R-'(t)—R;')y,(t)] 3 0 . So it can be obtained that 
i=l 
u(t) > 0, as ft, iiasr.. Thus from (18), it [10] yields d(t) 30, as too, iar., 
i.e. (17) holds. This completes the proof. 


6 Application to Signal Processing 


Consider the multisensor single channel Autoregressive (AR) signal with L-sensor 


A(q7')s(t)=w(t-1) . (20) 


y(t) = s(t) +v,(t),i=1,2,5L (21) 


where s(f) is the AR signal to be estimated, y,(r) is the measurements of the ith 
sensor, w(t) and v,(t) are independent white noises with zero mean and variances Q 


-l 


and R,, respectively. A(q™') is a stable polynomial of g™' with the following form: 


A(q')=l+aq't--a,q". 
The AR signal model denoted by (20) and (21) can be transformed into the state 
space model denoted by (1) and (2), where @,/°, H, are given by 


= 
= TI 0 

o6)=| See (Tee a Se SIE On gai oO) (22) 
-a, 0 0 0 


where ® contains the unknown parameter vector @=[a,,a,,---,a,]’ , I, denotes the 


nXn unite matrix. Thus, when a,,a,,---,a,, Q and R,; are unknown, the problem of 
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obtaining the self-tuning fusion Wiener signal filter s*(t|t) can be converted into the 


one of obtaining x°(t1t) because of the relationship 
s(t)=Hx(t) . (23) 


Substituting (20) into (21) yields the least squares structure y,(t) =) (O+ r(t), 


with definition @(t)=[-y,(@¢-),--.-y,@@-m! and) ,(t) = w(t-1)+v,(1) + 
ayv,(t—I)+-+-+a,v,(t—n). Based on the ith sensor, we can obtain the local RIV 


[7,11] estimates 6(t) of @, and it has been proved that the RIV estimate of the AR 


, 1G. 
parameters is strongly consistent. Then the fused estimate A(t) =—>6(1) is also 


i=l 
strongly consistent, i.e. A(t) 90, as to, w.p.1. 
From (20), (21) and the definition of 7,(t) , we have 


r(t)= Aq ')y,(t) = w(t-1)+ A v0) (24) 


Define the correlation function of r,(¢t) as R,,,(k)= Ely, (t)r, (t—k)]. Computing the 


rij 


correlation function of (24) yields 


R,, 0) =O+ Da, R6, » Ry (K) = 9) gy 4 R6y i JH), L,kaLe in (25) 
a=k 


a=0 : 
where 6,=1,6,=0 . Then the estimator of x(t) can be obtained by 


r.(t)= A(q") y,(t). Further, the estimator a (k) of the sampled correlation function 


(kK) =1D F(a? (a-b) , and it has been proved [8] that 
t 


a=l 


is defined as R' 


rij 


R(k) OR 


- (kK), aS t > oe, 1.a.r.. Substituting the estimates R (k) and A(q"') into 


rij 
(25), the estimator of R, can be obtained by 

: Ri (k Z nn 
R,()=— nil) : RQ) =-Y Re i=l Lk ee 
D4, (4, (0) i 

a=k 


(26) 


where RB (t) and R (t) are the local and fused estimators of R,, respectively. Simi- 
larly, the local and fused estimators of Q are obtained by 


8,1) = Ry O)- LAOR (05, OO =z YOO EJ beL (27) 


Applying the method in [4], it can be easily proved that R (tho R.,O(t) —>Q, as 
t > co, w.p.1. So Assumption 5 holds. 
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Substituting the estimators into the optimal centralized fusion Wiener filter, the 
self-tuning centralized fusion Wiener filter x°(t|t) is obtained. Hence from (23), the 
self-tuning fusion Wiener signal filter can be given by 


(tl) = AK (It). (28) 


7 Simulation Example 


Consider the single channel Autoregressive (AR) signal with L-sensor 


(+aq™' +a,q~)s(t)=w(t-]) . (29) 


y,(t) = s(t) +v,(t),i=1,2,3 . (30) 


where s(f) is the AR signal to be estimated, y,(r) is the measurements of the 
ith sensor, w(t) and v,(t) are independent white noises with zero mean and variances 
Q and RK; , respectively. In simulation we take a, =0.4,a,=—0.45,Q0=1 , 
R, =0.1,R, =0.3,R, =0.5. 

When the model parameters and noise variances are unknown, the fused estimators 
of the model parameters and noise variances are shown in Fig. 1-Fig.3, where the curves 
and straight lines denote the estimates and real values, respectively. The error curves 


between the self-tuning and optimal fused Wiener signal filters are presented in Fig.4, 
which show the convergence of the self-tuning fusion Wiener signal filter. 


RO 


“Oo 2000 4000 6000 8000 ~=o 2000 4000 6000 8000 


t/step t/step 
Fig. 1. Fused estimators of a,,j=,2 Fig. 2. Fused estimators of R,,i =1,2,3 
1.5 0.5 
Q(t) 
1 oO 
0.5 -0.5 
oO 2000 4000 6000 8000 Oo 2000 4000 6000 8000 
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Fig. 3. Fused estimator of Q Fig. 4. Error curves e(t) = s°(t|t)—s(t|t) 
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8 Conclusion 


For the multisensor system with unknown model parameters and noise variances, the 
information fusion estimators are obtained by the system identification algorithm and 
the correlation method. Based on the fused estimators and information matrices, a new 
self-tuning fusion Wiener filter is presented. Further, it is rigorously proved that the 
self-tuning centralized fusion Wiener filter converges to the optimal centralized fusion 
Wiener filter in a realization by the DESA method, so it has asymptotical optimality. 
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Abstract. By the covariance intersection (CI) fusion method, the covariance in- 
tersection fusion steady-state Kalman filter is presented for two-sensor system 
with unknown cross-covariances between local filter errors. It is proved that its 
accuracy is higher than that of each local filtering, and is lower than that of the 
optimal fuser with known cross-covariances. A Monte-Carlo simulation result 
shows that its accuracy is approximates to that of the optimal fuser. 


Keywords: Information fusion Kalman filter, steady-state Kalman filter, 
covariance intersection fusion, unknown cross-covariances, covariance ellipse. 


1 Introduction 


Multisensor information fusion Kalman filtering has been applied to many fields, 
such as guidance, defence, robotics, tracking, signal processing, GPS positioning. To 
compute the optimal distributed weighted fusion Kalman filter [1,2] requires that the 
cross-covariances among local Kalman filtering errors are known exactly. However, 
in many practical applications, the cross-covariances are unknown or computation of 
the cross-covariances is very complex, or to find the cross-covariances is very diffi- 
cult. In order to overcome these drawbacks, a covariance intersection (CI) fusion 
method was presented [3,4], which can handle the fusion problem with unknown 
cross-covariances. This paper, using the CI fusion method, a CI fusion steady-state 
Kalman filter is presented for two-sensor system with unknown cross-covariances. It 
is rigorously proved that the accuracy of CI fuser is higher than that of each local 
filter, and is lower than that of the optimal fuser with known cross-covariances, and 
the CI fuser is consistent, i.e. the actual variance for CI fuser has the theoretical upper 
bound obtained from the CI fuser. 


2 The CI Kalman Fuser 


Consider two-sensor system 


x(t+1) = Ox(t)+ Twit) () 
y= A,xO)+v,(t), = 1,2 (2) 
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where x(t)e R" is the state, y,(t)e R” is the measurement, w(t)e R’ and 


v,(t)e€ R”™ are uncorrelated white noises with zero mean and variances Q, and Q,,, 


respectively. 
The local steady-state Kalman filters are given by [5] 


K,(t1t)=¥,5,¢-Lt-D+K,y,@, 1=1,2 (3) 
PF, =U, -K,H1® (4) 
K, = 2H; [H2)H) +, 1" (5) 


where the symbol T denotes the transpose, &, satisfies the Riccati equation and the 


steady-state filtering error variance P. is given by 


i 


P=U,-K,H12; i=1,2 (6) 
the cross-covariance F, between the local filtering errors satisfies the Lyapunov 
equation 

Py = Le che oe +A, (7) 
A, =1,-K,H,WOl"U,-K,,H,7 (8) 


when P and P, are known, the optimal fuser weighted by matrices is given [1] 


Xj (t1t) = Qi (t1H)+ 2x, (eld) (9) 
with the optimal weighted matrices [1,2] 

Q=(P- PR +P -F,- Py)! (10) 

Q=(R FMA +P Py Py)" (1) 


with P., = P,,. The fused error variance matrix P, is given by 
R=h-(A-Pj A+R -P,- Py) (A -B,) (12) 


When P and P, are known, but the cross-covariance P, is unknown, the CI Kalman 


fuser without F, is given by [3,4] 


£, (tlt) = P, [oP Ss, (t1)+(-@)P,'8,(¢10)] (13) 


P,, =[@P'+(1-@)P'J' (14) 


Covariance Intersection Fusion Kalman Filter 439 


where We [0,1], and minimizes the performance index 
min trF, = min tr{ [oP'+(1-@)P,'T"} (15) 


where the notation tr denotes the trace of matrix. The optimal weighting coefficient 
@ can fast be obtained by the gold section method or Fibonacci method [6]. 


Theorem 1. The local and fused Kalman filters have the accuracy relation 
trP,, <trP., <trP i=1,2 (16) 


where the theoretical variance P., is determined by (15), is the actual cross- 
t 


Pe, 
covariance [3,4] for CI fuser X,,(t|f) given by (13) without P,. 
Proof. From (15) we have that taking @=0 yields trP., = trP,, taking w@=1 yields 
trP., = trP. Hence from (15) we have trP., < trP for we [0,1], i=1,2. The relation 


P., <P, was proved in [3,4], which yields trP., < trP.,. This shows consistency of 
CI fuser. 


Remark 1. Theorem | shows that the actual accuracy of CI fuser is higher than that of 
each local filter. 

In order to give a powerful geometric interpretation with respect to accuracy rela- 
tions of local and fused filters, the covariance ellipse for a covariance matrix P is 


defined as the locus of points {x: x’ P™'x=c} where c is a constant. In the sequel, 
c=1 will be assumed without loss of generality. The following facts were proved: 
P< P, is equivalent to that the covariance ellipse for P is enclosed in the covariance 
P, and P, lies 


within the intersection of ellipses for A and P,. The ellipse for CI fused covariance 


ellipse for p,. The ellipse for fused covariance P, with known P, 
P., with unknown P, encloses the intersection region of ellipses for P and P,, and 
passes through the four points of intersection of ellipses for P and P,. These facts 
are shown in Fig.1. 


Theorem 2. The local and fused Kalman filters have the accuracy relations 


P<P, tP, strP, i=1,2 (17) 
As Fy, tr, StrP, (18) 


Proof. The analytic proof was presented in [1]. The geometric proof is show in Fig.1. 
Since the ellipse for P, is enclosed in the intersection of ellipses for F and P,, then 


it is enclosed in the ellipse for P , which yields P, < P,, and it is also enclosed in the 
ellipse for P,, which yields P, < P,. Hence (17) holds. Since the ellipse for P, is 


enclosed in the ellipse for P.,, 


which yields (18). The proof is completed. 
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a 0.8 -06 -04 -0.2 0 02 04 O06 08 1 


Fig. 1. The accuracy comparison of P, P,, PR, and P,, by covariance ellipses 


Remark 2. Theorem 2 shows that the accuracy of the optimal fuser is higher than that 
of local filters or CI fuser. 


3 Simulation Example 


Consider the two-sensor tracking system (1) and (2) with 


‘ae #: _| 0.57, 
see ce e 
A =[1 0) Ae= oe 2 
cs Balt g (20) 


where x(t) = [x, (t) x, (| is the state, x,(t) and x,(t¢) are the position and velocity 
of target, TJ, is the sampled period, w(t), v,(t) and v,(t) are white Gauss noises with 
=0.5, 


2 
vl? 


zero mean and variances o-, O,, Q,, , respectively. In simulation we take T, 


0 
2 


0. =2, 0, =1, Q,, =diag(16,0.25), t=1,---,300. 

In order to verify the above theoretical results for accuracy relation, 200 Monte- 
Carlo runs are performed for t =1,---,300, the MSE curves of local and fused Kal- 
man filters are computed, where the MSE value at time f is defined as the sampled 
average for trP = trE[(X, (t 11) — x())(%, (41) —- x(t))"], (E denotes the expectation), i.e. 
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Sr In x) RO (t1N— x (D) i= 0,1,2, CF. (21) 


jal 


1 
MSE, (t) = — 
N 
where N =1,---,200, r=1,---,300, fj=1,2, #/ (tt) or x“ (1) denote the jth reali- 
zation of x,(t|t) or x(t), respectively. According to ergodicity 

MSE, (t) > trP, as N>~,t > ,i=0,1,2,Cl (22) 


The simulation results are shown in Fig.2. 


20 40 60 80 100 120 140 160 180 200 220 240 260 280 300 


—©— MSE1 (t) —®-—MSE2(t) —4&—MSEO(t) ——MSECI (t) 


Fig. 2. The comparison of MSE, (t) and trP , i=0,1,2,C7. 


where the straight lines denote trP , i=0,1,2,CI , the curves denote the correspond- 
ing MSE, (t). We see that the values of MSE, (tf), (= 0,1,2) are close to the corres- 
ponding trP , which verifies the ergodicity (22). We also see that MSE_,(t) < trP.,. 
According the ergodicity, MSE,, (t) > trP., , hence trP., < trP.,. This verifies P., is 
a upper bound of Poe: Finally, we see the CI fuser has good performance, whose 
accuracy is approximates to the accuracy of the optimal filter, because MSE_,(f) is 


approximates to MSE, (f). 


4 Conclusion 


A CI fusion steady-state Kalman filter has been presented for systems with unknown 
cross-covariances, which has advantage that the computation of cross-covariances can 
be avoided. It is proved that its accuracy is higher than that of each local filter, and is 
lower than that of the optimal fuser with known cross-covariances. Simulation results 
show that its accuracy is approximates to the accuracy of the optimal fuser. 
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Abstract. For the multisensor systems with unknown model parameters and 
noise variances, based on the system identification method and correlation me- 
thod, the online information fusion estimators of model parameters and noise 
variances can be obtained. Substituting them into the optimal fused Kalman 
smoother weighted by scalars for components, a self-tuning fusion Kalman 
smoother weighted by scalars for components is presented.The proposed self- 
tuning Kalman smoother converges to the time-varying optimal fusion Kalman 
smoother in a realization, so that it has asymptotic optimality. It can be applied 
to self-tuning signal processing. A simulation example shows its effectiveness. 


Keywords: multisensor information fusion,decoupled information, identificati- 
cation, self-tuning Kalman smoother. 


1 Introduction 


With the high-accuracy requirement on target tracking and signal estimation in many 
high-technology fields, the multisensor information fusion has received great atten- 
tion in recent year[1].The optimal fusion Kalman smoother by [2] requires the model 
parameters and noise statistics are known. This restricts their practical applications. 
And the filtering for the systems with unknown model parameters and/or noise va- 
riances is called self-tuning filtering [3]. Several self-tuning weighted fusion filters 
were presented only for systems with unknown noise variances [4-7]. Their draw- 
backs are that only the noise variances are assumed to be unknown, while the model 
parameters are assumed to be known. 

In this paper, using the classical Kalman filtering method, the self-tuning inform- 
ation fusion Kalman smoothers weighted by scalars for components is presented for 
the multisensor systems with unknown model parameters and noise variances. The 
proposed self-tuning information fusion Kalman smoother converges to the optimal 
information fusion Kalman smoother in a realization, so it has asymptotic optimality. 
In [8], a self-tuning information fusion Kalman smoother is presented by using the 
modern time series analysis method. Therefore, compared with [8], the on-line identi- 
fication of the ARMA innovation model is avoided. 
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2 Optimal Fusion Kalman Smoother Weighted by Scalars for 
Components 


Consider the multisensor linear discrete-time stochastic system 


x(t +1) = &(8)x(t) + Iw(t) (1) 


y,(t) = H,x(t)+v,(), F=1,2,5L (2) 


where f is the discrete time, x(t)e R", y,()e R”™ , w(t)e R" and v,(t)e R™ are the 
state, measurement output, the input noise and measurement noise of the ith sensor 
subsystem, respectively. The transition matrix ® contains a unknown parameter 
vector Oe R*, and each element of @ is a continuous function with respect to @. 
When @ is known, we denote (0)=@. 


Assumption 1. w(t) and v,(t) are the uncorrelated white noises with zero means and 


variances Q>0 and R, >0, i.e. 


wt) | 7 7 _|@ 0 
e( ‘| [wi(k) v; w}-{f RO, Sx (3) 


where 6, is the Kronecker delta function, i.e. 6, =1, 6, =O(¢#k), and the super- 
script T denotes the transpose. 


Assumption 2. When @ is unknown, each unknown element of @ is a continuous 
function with respect to @. When @ is known, @ is non-singular, (®,H;)is a com- 


pletely observable pair, (®,/°) is a completely controllable pair. 


Assumption 3. The matrices H, and I are known, but the parameter vector 0, and the 
noise variances Q and R,(i=1,---,L) are completely or partially unknown. 


Assumption 4. The measurement data y,() (a realization of measurement stochastic 
process y,(t) ) are bounded for ¢, i=1,2,---,L. 


Denoting the state x(t) in the component form as 


xpN=[4O, x, (0) (4) 


The problem is to find the self-tuning information fusion Kalman smoother weighted 
by scalars. 


Lemma 1 [9]. For the multi-sensor system (1) and (2) with known model parameters 
and noise variances, the ith sensor subsystem has the local optimal Kalman predictor 
X,(t+11t) of x(t) as 


4,(t+ 111) = %,()4,(t1t-) + Kp Oy, (5) 


Y(t) =D — Kp, (tH; (6) 
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K p(t) = ®3,(t tH; (A;3,(tlt-)H; +R,)' (7) 
where the prediction error variance matrix satisfy the optimal Riccati equations 


Z(t+ll) = OLY, (tlt-Y)-3,(¢1¢t-DA} (4,2, lt -DH} +R)" 


(8) 
XH, X,(tlt-Djo' +7or" 
and the local prediction cross-covariances satisfy the Lyapunov equation 
FE, (t+1 lt) =", (OZ, C1t-D¥{O+TOM" ,i# ji, f=12--L (9) 


Lemma 2 [9]. For the multi-sensor system (1) and (2) with assumptions | and 2, the 
ith sensor subsystem has the local optimal time-varying Kalman smoother as 


N 
$0 N11) =3,0-NU-N-D+ ) K,G- NUN + pe t-N+ pis besL (10) 
j=0 


where the optimal Kalman predictor x;(t— N|lt—N-—1)can be computed by (5), and 
we can obtain 


E(t) = y,(1)- A,X, (t1t-1) (11) 


jol 
K (let N= 2 Clt— DU] [Mi + OMT HZ C+ let j-DHP +R" - 
k=0 


K,(t\t)=5,(0¢lt-)A} (A, 2,(t1t-DH; +R)" 


The error variance matrices and covariance matrices of local optimal Kalman smooth- 
er are given as 


N 
P(t-N\t)=5,(t-N|t-N-1) EG NiIt-N+j) 
70 (13) 


x(H,2,(t-N+jlt-N—-1+ /)H) +R,)K/ (t-NIt-N+j)) 


N 
Pj(t-NIt)=Y%y(t-N)Z,(¢-NIt-N-D¥ y(t N) +) Kp@-N)OKY t-N) 4) 
p=0 


i# j,i, fj =1,2,---,L 


with the definition P,(t-NIt)=P(t-N It). 


N 
Vxut-N)=1, VK NIt-N+k)H¥,(t-N+k,t—-N), Kiy(t-N)=0 
k=0 
(15) 


(t-N+k,t-N+p+bI, 


N 
KR¢-N)=- )K,t-NIt-N+ DAY, 


k=p+1 
p=0,---,N-1 
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Lemma 3 [9]. For the multi-sensor systems (1) and (2) with assumptions | and 2, we 
have the optimal information fusion Kalman smoother weighted by scalars for 
components 


G 
X(t-NIn= ) QA, Hx (t-NIt) 
; ye (16) 


Q(t) = diag(@;,(0),--+, Oj, (0) 
where the optimal scalar weighting coefficient vectors @,(t) are given by 
a(t)=[a,0), —-, @,O)]=le(PXG-N Ip) tele (PM(t-NIDy i=lesn (17) 
where e! =[l,---,1], and the LxL matrix Py (t- N|t) is defined as 
P'¢-NID)=(3C-NID), FHL L (18) 


whose (k, j) element a (t— N It) are the (i,i) diagonal elements of Pi(t-N It). 


3 Self-tuning Fusion Kalman Smoother Weighted by Scalars for 
Components 


When model parameters and noise variances are unknown, substituting their estima- 
tors into the optimal fusion Kalman smoother will yield the self-tuning fusion Kalman 
smoother. It consists of the following steps: 


Step 1. Applying the system identification algorithm[10](for example, the recursive 
instrumental variable (RIV) algorithm, the recursive extended least square (RELS) 
algorithm), and the correlation method [11], the information fusion estimators 


6(t) O(t) and R(t) are that for the model parameters @ and noise variances 
Q,R,(i=1,---,L) can be obtained [12]. 


Step 2. In Lemma 1, ®,Qand R&,(i=1,---,L) are replaced by @(A(t)) ; O(t) and R, (t), 


repectively. So the estimates ¥ 


pi (t) and K..(t) can be obtained. And the estimates 


pi 


3,(t|t—1) satisfy the self-tuning Riccati equations 


S(t +111) = OM) (S, (tlt -) -— 3,0 1t -DH} (A, 2, 1t-H} + Ass 
RO) 1H, 3, 1t- DIG" (OO) + OWL" 


and the prediction cross-covariance matrices satisfy the self-tuning Lyapunov 
equations 


Zi (t+ ln =%p, 3; (t1t-D¥y (Ot LOOM i# j,i, 7=120L (20) 
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Substituting the estimates into (5), self-tuning local Kalman predictor can be given as 
+11) = % (D5) C1 t- 1) + Kp (y,() (21) 
Substituting the estimates into (10)-(15), yields the self-tuning local Kalman smoother 
ul A 
Xk? (t-N1t)=x;(t-N | t-N-1)+)) R\G-N It-N+ Pé(t-N+j)i=1-,L (22) 
j=0 
And, the estimates é, (t), K; (t-NIt-N+ D)PU-N | 1), B, (t—N\t) can be obtained 
from Lemma 2. 


Step 3. Applying (18) and (19), the estimates P(t-N\t) and Q, (t) are obtained, and 
the self-tuning fused Kalman smoother is given as 


L 
S0-N I=) QR O-N I) (23) 


j=l 
The above three steps are repeated at each time t. 


Assumption 5. The parameters and noise variance estimators A(t) ; O(t) and 


ROG =1,---,L) are consistent, i.e. 


A(t) > 8, O(t) >Q, R(t) >R,, aS t3%, Lar (24) 


where the notation “‘i.a.r’” denotes “in a realization” . 
Applying to the dynamic variance error system analysis (DVESA) method [13, 


14], the solution be (t+1lt) of the self-tuning Lyapunov equation (20) converges to 


the solution ,,(¢+11t) of the time-varying optimal Lyapunov equation (9), i.e. 
[2,(t+11)-3,(¢+ 11] 90, as te, iar (25) 


Then, it can be proved that local self-tuning Kalman predictor x/(t+11r) converges 


to the local optimal Kalman predictor x,(t+11t) by the the dynamic error system 
analysis (DESA) method [4],i.e. 


[4)(t+111t)-%,(¢+11H] 30, as to, Lar. (26) 


From (25) and (26), it can be obtained easily that the local self-tuning Kalman 
smoother *;(t—NIt) converges to the local optimal Kalman smoother %,(t—NI1), 
and the self-tuning fused Kalman smoother x(¢— Nt) converges to the optimal fused 


Kalman smoother x,(¢—N It) in a realization, i.e. 


E(t-NIt)-%,(t-NIf) 90. as ft, Lar. (27) 


Xg(t-NIt) > X)(t-NIt), as too, La.r. (28) 
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4 Simulation Example 


Consider the multisensor single channel Autoregressive (AR) signal with white 
measurement noise 


A(q7')s(t) = w(t-D) (29) 
y,(t) = s(t) +, (t),§ =1,2,3 (30) 
A(q"') =It+aq"' +aq° +aq° (31) 


where y,(t)e R is the measurement of the ith sensor, v,(t) , w(t) are independent 


Gaussian white noises with zero mean and variance 02,07 


vi?“ w ? 


respectively. Assume 
a,,4),4;,02 and o%, are unknown. The aim is to obtain the self-tuning fused Kalman 


signal smoother 55 (t—111) . 
The system (29) and (30) have the state space model 


x(t +1) = &(6)x(t) + Tw(t) (32) 
y,(t)= Ax(t)+v,(t), i=1,2,-+-,L (33) 
s(t) = Hx(t) (34) 


with the companion form 


-a, 


ay Lint) 


@= s7=[1 0 -- ola=ll o - Oo] (35) 


-a 0 - 0 


n 


where @ contains the unknown parameter vector 6 =[a,,a),---,a,|' . Using the above 
method, the optimal and self-tuning fused Kalman signal smoothers 5,(t¢—111) and 


So (t—11t) are given by 
So(t-11t) = Axy(t-11t), 55 (¢-11t) = Hx (t-111) (36) 


where xX,)(t—Ilt) and xj(t-11t) are the optimal and self-tuning fused Kalman 


smoothers for the system (29) and (30), respectively. 
In simulation we take that 


A(q”') = (1+ 0.6g7')+0.7q~)-0.8q7') =140.5q7! —0.62q7' — 0.336q° 
(37) 
Go; 215 6-02 2, =04 o,,=06 
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Applying the RIV algorithm, the estimate a, of a, (j =1,2,3) can be obtained. Based 


on the identification of model parameters, and based on the correlation method, the 
on-line estimators of the noise variances are obtained. The convergence of estimators 
of AR parameters and noise variances is shown in Fig.1-Fig.3, Fig.1-Fig.3 show the 
consistence of estimators of the model parameters and noise variances, where the 
straight curves denote the true values, the solid curves denote the estimators. The 
error curve between the self-tuning and optimal fused Kalman signal smoothers is 
presented in Fig.4, where we see that the self-tuning fuser converges to the optimal 
fuser, i.e. its error converges to zero. Therefore, the self-tuning fused Kalman signal 
smoother has asymptotic optimality. 


m ra 1 
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0.5 va ee ne o, V9. | 
AO wy) 
6 7 | 08} i eee ee Oe 
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Fig. 3. The convergence of ome Fig. 4. The error curves e(t) = $9 (t —11t) — Sy(t -111) 


5 Conclusions 


For the multisensor system with unknown model parameters and noise variances, a 
self-tuning information fusion Kalman smoother weighted by scalars for components 
has been presented by the classical Kalman filter method. By the system identification 
algorithm and the correlation function method, the estimators of the model parameters 
and noise variances are obtained. The proposed self-tuning fused Kalman smoother 
converges to the optimal fused Kalman smoother in a realization, so that it has the 
asymptotic optimality. 
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Abstract. In this paper, we propose a new algorithm adopting the spectrum 
sensing function of cognitive nodes, selections of cluster-heads and gateways 
can be made on the basis of dynamic information. Taking the cluster-head as 
anchor point, by choosing gateway nodes that are less affected by primary user 
in quest of connected dominating set, a virtual backbone network can be estab- 
lished. Based on the comparison of weights, through the sensing of the location 
of the primary user, the spatial distribution area of the cognitive nodes are di- 
vided and assigned different weights, thus realizing partition processing in the 
network. On one hand, it maintains relative stability of the sub-clusters in mo- 
tion; on the other hand, it reduces the probability of interference links. In the 
end, and simulation results confirm the verification of the algorithm. 


Keywords: Cognitive radio; Virtual backbone network; Topology; Clustering. 


1 Introduction 


Mobile ad hoc wireless networks (MANET) is a wireless communication systems 
which deploying in small domain without fixed infrastructure support. Traditionally, 
the network architecture are classified as the planar structure and the hierarchy struc- 
ture. Clustering is considered to the core technology to construct the management and 
transmission layer for the hierarchical networks. Although the tentative plan is ab- 
sorbing, it's the famous NP-hard problem in Graph Theory[1]. Nowadays, the virtual 
backbone networks construction methods can be divided into two groups, one is the 
centralized type, and another one is the distributed type. Zhao[2] design a distributed 
approximation algorithm to calculate the connected dominating set by choosing the 
one-hop gateways in the dense ad hoc networks. HAN[3] proposed a zone-based dis- 
tributed algorithm for CDS formation. IAN[4] puts forward the system architecture of 
the wireless networks being integrated cognitive radio technologies. The study about 
topology management is few now, in accordance with the new features of cognitive 
radio ad hoc network. 

The MANET networks integrated with cognitive radio(CR) support the capacity of 
temporarily borrowing the Primer User(PU)’s authorized spectrums as the additional 
communication bandwidth while the PU is off. But the system is demanded to switch 
to another available band. The spectrum handoff will result in some delay, because 
the CR user need to configure the parameters of the transmitter such as the operating 
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frequency and the modulation mode, to hold the current connections, and even to trig- 
ger the process of the routing refresh. In this paper, a novel method is proposed to 
building the topology with lower PU interferences and relative stability in motion. 


2 Related Convention and Mathematical Model 


Dominating Set. In graph theory, a dominating set for a graph G is a subset D of V 
such that every vertex not in D is joined to at least one member of D by some edge. If 
every proper subset of dominating set D is not the dominating set, D is said to be min- 
imum dominating set. 


Independent Set. An independent set is a set of vertices in a graph, no two of which 
are adjacent. A maximal independent set is an independent set such that adding any 
other vertex to the set forces the set to contain an edge. A maximum independent set is 
a largest independent set for a given graph G and its size is denoted a (G). 


Unit Disk Graph. In the simple undirected graph G , two vertices u and v are con- 
nected by an edge if and only if the distance between u and v is less than the definite 
value. The graph G is said to be Unit Disk Graph(UDG). 

Unit Disk Graph is used in computer science to model the topology of ad-hoc 
wireless communication networks, among which, vertex set represents the nodes and 
edge set represents the links. And the area within which a signal from one node can be 
received by another node is modeled as the circle.All above, the formation of virtual 
backbone network can be abstracted to the calculating approximated solution to the 
connected dominating set in the unit disk graph. 


3 Formation of Virtual Backbone Network 


3.1 Calculation of the Weight 


Taking the cluster-head as anchor point, by choosing gateway nodes that are less af- 
fected by primary user in quest of connected dominating set, a virtual backbone 
network can be established. The cognitive user gain the capacity, based on the PU 
location technology of cognitive radio[5], to estimate that PU in the shadow of it’s 
omni-directional antenna. Furthermore, the node judge whether it should cause the 
potential interference. Based on this, the vertex set V is divided into Vi , the set of 
nodes locate in the potential interference zone and Vn, the complementary set of Vi, 
V= Vi +Vn. For the nodes locate in the potential interference zone, select the nodes 
which one potentially bring less interference to play the role of cluster head and the 
gateway. For the nodes locate out of the potential interference zone, select the nodes 
which one taking on more stability to play the role of cluster head and gateway.For 
the sake of providing a easy solution based on open standards for the practice of 
weight comparison of the nodes in or out the potential interference zones, it is neces- 
sary that make the weights meet the totally ordering relation. We define the mathe- 
matic expression of the weight calculation as follows. In the cluster head electing and 
clustering stage: 
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In the gateway choosing stage: 
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Where vch is the cluster head, g is the gateway. fnode and the fgw are the weight cal- 
culation formulas, the function interfn and the function interfgw are used to estimate 
the interference degree of the node and gateway with PU, respectively.The function 
frode designed in this paper refers to the cluster availability, which is determined by 
the stability of every links between the current node and its neighbors in MANET. 
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Links Availability I; is defined as follows: 
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Where the Sm,n is the distance between the node m and n; ASm,n is the distance 
increment, ASm,n<0 denote the pair nodes keep the co-rotating motion, while the 
ASm,n >0 means them move away from each other; vmax is the maximum speed; At 
is the adjacent time interval between the changes of speed, supposing that speed and 
direction change only at certain point of time, and remain unchanged during a fixed 
period of time. The movement in time interval At can be predicted by the movement 
in At, which can be estimated by the signal intensity measured in the points T and 
T+A t. I]; is the Links Availability of current link, which is determined by both the 
prediction results of relative motion and the actual distance of the source and 
destination. Links Availability I; [0,1], the bigger the I; is, the more stable the link 
is. The function interf, in (1) is defined as follows, indicating the interference degree. 


interf,=1/dist(v. Vj) - (5) 


In terms of the Gateway, the functions fgw and interfgw in (2) is designed as follows, 
where the V2hop is the 2 hops gateway set and the V3hop is the 3 hops gateway set, 
the min refers to the minimum value, the dist indicate the distance between the current 
pair of nodes. 


iSt( 8 Vong) + dist 8.5 Vong) 8. EVonop 
foe h hd 2hop (6) 


7 min(dist(g,, V4.) + dist(g j.Vons)) + MIN(AiSt( 8; Vong) + diSt(¥ Vena) Bx © Vonop 


454 F. Wen-Jiang et al. 


fa- | V/dist(g,Vpy) 81 © Vohop oy 


Vdist(g;, Vpy y+ Vdist(g ,Vpy ) 81.8) © Vahop 


3.2. Algorithm Design 


The formation of virtual backbone network is the method based the mode that con- 
structing the connected dominating set in the basis of the maximum independent 
domination. The method is divided into two phases: first to solve the maximum inde- 
pendent domination, second to insert few nodes to connect the independent points, 
and to make the independent domination is connected. 

The clustering algorithm using staining patterns can be seen as the solution to the 
maximum independent domination. The nodes in the network are divided into three 
states, the cluster head, the cluster member, and the undecided state. Solution 
maximal independent set of algorithm flow is as follows. 


Algorithm 1. Solution maximal independent set 

1: Cluster heads extract the gateway status from the local topology information; 

2: The node under the primary user location information to determine if they 
located a potential interference area, and calculates the weight of its own for clus- 
ter head according to cluster stability or the interference degree, then broadcast the 
weight to all its neighbors; 

3: After get all neighbors weight, the node compare them with its own, if the 
current node hold the greatest one (In our design, the greater the weight is, the 
higher the priority is.), it upgrades to the cluster head state from the undecided 
state, and request the neighbors to be the cluster members in the cluster; 

4: For the node at undecided state and get requests, it chooses to join the cluster 
dominated by the cluster head with highest priority. Then, it broadcasts the state 
information; 

5: Repeat 3 and 4, till none of the nodes are marked as the initial state. 


Connected dominating set construction algorithm flow is as follows. 


Algorithm 2. Connected dominating set construction 

1: In the initial moment, all nodes are in the undecided state, and periodically 
send control messages with the basic information to discover the local topology 
situation; 

2: Cluster heads sort the candidate gateways in accordance with the principle that 
the gateway located out the potential interference area is more prior than the one in 
and the 2 hop gateway is more prior than the 3 hop gateway. For the gateways in 
same type, compare the weight described above; 

3: Cluster heads chooses the one with highest priority to mark as the gateway. 
The process is over, till every cluster head have and only have a gateway to the 
adjoining cluster head. 
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3.3. Proof of Correctness of Algorithm 
Lemma 1. Eventually the algorithm terminates. 


Proof. In the beginning, any node in the network is in the undecided state. The /; re- 
fers to the nodes in the undecided state set. Since every node can determine its cluster, 
the I; set will eventually become empty. Thus, the clustering phase will terminate. Af- 
ter clustering phase, it turns into the gateway selection phase. The cluster head with the 
lower ID of adjacent clusters should choose the optimal candidate gateway to mark as 
the gateway and notify the other of the result. The total number of nodes in the network 
is given, so the sum of cluster heads is limited. There will be one and only gateway 
between every adjacent clusters pair. The J; refers to the set of the adjacent clusters 
pair which have not chosen the gateway. J; set will be empty. Thus, the gateway selec- 
tion phase will terminate. The proposed algorithm will converge. 


Lemma 2. The algorithm obtains a connected dominating set under the connected 
graph condition 


Proof. Set / is a maximum independent set, which denoting the cluster head set. J is a 
dominating set. Any vertex v€ V-J, v is adjacent to one vertex u, u€ I. Assume that G 
is a simple connected graph, any vertex v€/J/, there is a vertex u, they are reachable for 
each other within 3 hops. In the proposed gateway selecting, every adjacent cluster pair 
will be inserted a gateway. So, if the vertex v and uJ, they are reachable for each 
other within 3 hops, there will be a gateway to connect them. The induced graph 
G(V’,E’) of G(V,E) is connected, set V=/+S,. It means that the reachability will 
not change after the algorithm execution. The set S.q= +S, is a connected dominating 
set. 


4 Performance Simulation and Results Analysis 


In our simulation, N mobile nodes were randomly and uniformly distributed in a 
plane area of XxY, the nodes could move in all possible directions with displacement 
varying uniformly within[0,Vmax], and the radius of antenna coverage is R. The si- 
mulation parameters have been listed in Table 1. The running time is 200s. 


Table 1. Simulation Parameter 


Parameter Meaning Value 

N Number of nodes 50 

XxY simulation area 100x100(m) 
Vinax Max speed of the node motion 2-18 (m/s) 
R radius of antenna coverage 10-60(m) 


Number of PU_ Number of Primer user 1-5 
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The number of potential interfering links. If one or both nodes the current link 
connecting is in the potential interference area, the link is considered to be cause a 
potential interference to the PU. A potential Interference link means that the spectrum 
has to hand off as soon as the primary user is working. 


Re-affiliations. it refers to the sum of the nodes whose logical relationship to its 
neighbor change caused by the relative motion per unit time. 
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Fig. 1. Number of Interfering links number VS number of PU 


Fig. | shows the variation of the number of interfering links with respect to the 
amount of PU in the different topology formation algorithms. From top to bottom they 
are: the max power topology, Lin-Gerla algorithm[6] with redundant gateway, Lin- 
Gerla algorithm with choosing the only gateway according to the lowest id 
principle[7], and the novel algorithm. As we know, the lower the amount of links is, 
the lower the number of interfering links is. The amount of interfering links of max 
power topology is the highest; the amount of the topology of Lin-Gerla algorithm with 
redundant gateway is lower than the max power topology because the links between 
the nods in the same cluster is wiped off. The cluster head election and gateway 
selection mechanism, which tends to select the nodes with high stability and far away 
from PU position as the member in virtual backbone network, of proposed algorithm, 
ensures the number of interfering links is the smaller than the Lin-Gerla algorithm. 

Fig. 2 shows the variation of Re-affiliations with respect to the max speed of 
motion. The novel algorithm, realizing partition processing in the network, can 
produce a stable cluster out the potential interference area, so that the re-affiliations 
per unit time are lower than Lin-Gerla. 

In the Fig. 3, the triangle indicates the PU, the circle indicates the cluster head, the 
cross-sing indicates the gateway, and the dot is the common node. The figure shows 
the novel algorithm maintain the connectivity of the generated topology, the structure 
is more sparse, and the nodes less play the role of cluster head and the gateway, in the 
vicinity of the PUs of cognitive. As the structure of the virtual backbone network is 
mesh-like, it can support the network layer to select the right path. 
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Fig. 2. Re-affiliations per unit time VS speed 


Fig. 3. Topology graph of the novel algorithm 


5 Conclusion 


This paper focuses on topology management based on clustering in cognitive MA- 
NET, combines PU location and spectrum sensing of cognitive radio and MENAT 
network topology management. Subject to cognitive MANET, by spectrum sensing 
and cognitive node positioning, a virtual backbone construction method is proposed, 
which is based on the space and time information of the primary user’s working sta- 
tus. Virtual backbone construction includes cluster head election and gateway selec- 
tion. Cluster heads are taken as anchor points, the connected dominating sets are 
obtained by selecting gateway nodes that are less affected by the primary user, and 
then the virtual backbone can be established. The results show that it can maintains 
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the stability while reducing the interference of the PU. We simulated and verified the 
proposed algorithm, and analyzed the relative results. 


Acknowledgments. This work is supported by the National Natural Science Founda- 
tion of China (60872038), and the Natural Science Foundation Project of CQ CSTC 
(CSTC,2009BA2064). 


References 


1. Karaki, J.A., Kamal, A.E.: Efficient virtual-backbone routing in mobile ad hoc networks. 
Computer Networks 52(2), 327—350 (2008) 

2. Zhao, C.X., Wang, G.X.: Gateway Selection Strategy in Dense Manet. Chinese Journal of 
Computers 28(2), 195-200 (2005) 

3. Han, B.O.: Zone-based virtual backbone formation in wireless ad hoc networks. Ad Hoc 
Networks 7, 183-200 (2009) 

4. Akyildiz, I.F., Lee, W.Y., Chowdhury, K.R.: Cognitive radio ad hoc networks. Ad Hoc 
Networks 7(5), 810-836 (2009) 

5. Ma, Z.Y., Chen, W., Cao, Z.G.: Analysis on Detection Probability Based Primary User Lo- 
calization Algorithm in Cognitive Radio Networks. Journal of Beijing University of Posts 
and Telecommunications 32(2), 14-19 (2009) 

6. Lin, R., Gerla, M.: Adaptive Clustering for Mobile. Wireless Networks 15(7), 1265-1275 
(1997) 

7. Stojmenovic, I., Seddigh, M., Zunic, J.: Dominating sets and neighbor elimination-based 
broadcasting algorithms in wireless networks. IEEE Transactions on Parallel and Distri- 
buted Systems 13(1), 14-25 (2002) 


Development of the Nerve-Central Listen System 
Based on Component 


Xu Dahua 


College of Engineering, Nanjing Agricultural University, 
Nanjing 210031, China 
xudahua@njau.edu.cn 


Abstract. Researching on component, a nerve-central listen system model was 
designed based on the HIS (Hospital Information System). Data collection, 
transmission and retrieval in the listen system was described. By comparison of 
video coding technology, selected a suitable coding technology for this system, 
and discussed the yawp removal technology to video data stream in the system. 
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1 Introduction 


Nerve-central listen system is a set of software and hardware systems, which is used 
to achieve the patient's physiological parameters for real-time monitoring and centra- 
lized management. Through the application of the system, patients’ life and physical 
parameters can be real-time viewed and took timely first-aid on critically ill patients, 
greatly reduce the intensity of nurses, improve rapid response capability on heavy 
patient condition monitoring. So the application prospect is very wide. The system 
has been put into use in many hospitals’ care units and worked very well. 

Component is a combination of a group of packages, and completes one or more 
functions on behalf of specific services, but also provides users with multiple inter- 
faces. At different levels, the components can combine the underlying logic into a 
larger particle size of high levels of new components, or even directly package into a 
system that allows reuse of modules from the code-level, object level, schema level 
to the system level are possible, so that the same software as the hardware can as- 
semble custom made submissive dream a reality. Using component-based system 
model can greatly reduce the system development cycle and improve code reuse and 
extensibility. 

The nerve-central listen system, as a technology project and business application 
project, has a wide theory and clinical application. The project will ultimately achieve 
in use in several hospitals. As different requirements required by hospitals, in order to 
enhance the system's flexibility and scalability, using a component-based develop- 
ment methodology can enhance the system scalability and creativity, and improve the 
theoretical research value and business value of the system. 
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2 Component Technology 


Component technology promoted the content of object encapsulates, it focused on the 
harmonious relations of complex components and stressed the existence of physical 
form in the environment. It has four basic attributes: 


(1) Component is independently configurable elements, the component must be 
self-tolerance. 

(2) Component stresses separation of the environment and other components, so 
component is the strictly packaged. The external has no chance or no necessary to 
know the internal implementation details. 

(3) Components can be complex used in an appropriate environment. The compo- 
nents need to provide a clear interface specification that can interact with the 
environment. 

(4) Component should not be continued (i.e. component has no individual-specific 
properties), it should not be distincted with its own copy, and only one copy at most 
in any environment. Easy to see, the components follow the package characteristics of 
the object, but not limite to an object, which can be encapsulated within one or more 
classes, the prototype object or process, the structure is flexible. Component high- 
lights the characteristics of tolerance and self-tolerance, that is the necessary features 
as a part of software product line. 

Viewed from the abstract level, object-oriented technology has reached a class- 
level reuse (code reuse), It was packaged in units of class. Such reuse granularity was 
too light to solve the heterogeneous interoperability and efficient reuse. Component 
would be referred to a higher abstractive level. Component is a combination of a 
group of packages, and completes one or more functions on behalf of specific servic- 
es, but also provides users with multiple interfaces. The components hide the concrete 
implementation, only provide services by interfaces. Thus, at different levels, the 
components can combined the underlying logic into a larger particle size of high le- 
vels of new components, or even directly packaged into a system that allows reuse of 
modules from the code-level, object level, schema level to the system level are possi- 
ble, so that the same software as the hardware can assemble custom made submissive 
dream a reality. 


3 System Architecture 


The system consists of the bedside machine for distributed monitoring and the central 
machine for centralized management. The central machine and the bedside machine 
are connected by the corresponding data lines to transfer data in two-way. The central 
listen system software partly includes the central machine’s control software and the 
bedside machine’s function software. in addition to the physical hardware connection 
to central listen system, but also the coordination of software. The central machine’s 
control software and the bedside machine’s function software each has a communica- 
tion software module responsible for the connection to ensure two-way data transmis- 
sion. Using RS485 to transmiss data is simple, easy and reliable, so we use serial port 
for data communication in central listen system. 
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The central machine and bedside machine were connected with RS485 cable 
through their respective configuration of serial communication ports to achieve a 
physical hardware connection. Using RS485 bus, as data transfer link, achieves half- 
duplex asynchronous communication. On the one hand, RS485 standard, as multi- 
point, differential data transmission of electrical specifications have become one of 
the most widely used in standard communication interface. This communication inter- 
face allowed simple multi-point, two-way communication on twisted pair, each 
terminal just hanging in the bus through an interface, this could achieve the real multi- 
point bus architecture. Its yawp rejection capability, and data transfer rate, cable 
length and reliability were unmatched by other standards. On the other hand, RS485 
standard only made regulations upon interface electrical characteristics, rather than 
connector, cable or agreement, users could create their own communication protocol 
on the basis. 

As the system is complex and reliability demanding, so we use component-based 
intelligent control technology, shown as in Figure | the central nervous control sys- 
tem topology architecture, effectively improve reliability and stability of the system. 
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Fig. 1. System Topology Architecture 


4 Bedside Machine Design 


Using industrial single board computers, communications components completed the 
calling provided by the MOXA serial port application development libraries, the 
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embedded process of bedside machine was designed with WINCE C language. The 
central processing components of bedside machine used interrupts to receive control 
commands sent by the central machine, and forwarded the command to the data ac- 
quisition components. When the central processing components organized data into a 
predetermined data packet. And then sent to the central machine through the serial 
port. When the system started to initialize the serial port (designated port communica- 
tion parameters and set the interrupt service routines), and then waited for the disrup- 
tion, control commands were received and processed by the interruption service 
components. 


Parameter Setting: EEG, 
ECG, blood pressure, 
respiration, temperature, oxygen, 
snoring, EOG, EMG, video 
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Fig. 2. Schematic of work of bedside machine 


In this system, many of the data involved. In all data to be collected and trans- 
ferred, video data transmission was more complex. The key coding techniques were 
MPEG-4 and H.264 compression technology, as H.264 encoding made the video 
compression decompression at less than the rate of 28.8Kbps ,so it could real-time 
mobile video capture, compression, decompression and playback. H.264 encode used 
unique P frames and B frames adaptive compression technology to further enhance 
the compression algorithm of the compression ratio, with IP multicast capabilities, 
greatly reduced the real-time transmission network bandwidth. When multiple control 
points work simultaneously, which caused transmission bit stream of data at the same 
time. The average bandwidth less than 250kbps, greatly reduced the input for user's 
network resource. The video streams of the system used H.264 and AVI data file was 
stored in the system, all the data files saved in binary form, not only guaranteed the 
data transfer speed, but also effectively reduced the amount of data acquisition sys- 
tems. System video processing core code was as follows: 
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AVIFileInitQ; = // AVIFILE Library of Initialized system 
AVIFILE af; // File pointer 

HRESULT hr; 

Hr=A VIFileOpen(&af, LPCTSTR(filename),OF_CREATE,NULL); 
AVISTREAMINEFO sthd; // Video data stream structure 
AVISTREAM as; // Video data stream interface 
SetRect(&sthd.reFrame,0,0,biout.biWidth,biout.biHeight); 


The video stream transmitted to the central machine through data lines, the central 
machine stored and analyzed the collection of data. The transmission process of video 
streams completed by a secondary thread and the secondary thread obeyed the RTP 
protocol, combined with asynchronous transfer mode and multi-buffer the video data 
stream real-time transmission, and thus to be a good solution to the local broadcast 
and network transmission time difference. 

The hospital environment was complex, the acquired video images existed yawp. 
In order not to affect system monitoring performance, we have developed an "Edge 
Enhancement Algorithm Components" in this system. Through this component to 
enhance images, the specific implementation was divided into the following two 
steps: 


(1) By morphological filtering the images with the Y component of YUV system, 
first needs to convert RGB image to YUV to obtain the image intensity values Y 
component. Then mathematical morphological filtered the Y component. 

(2) After morphological filtering on the Y component for median filtering and ma- 
thematical morphology filtering for Y component, it could eliminate a portion of the 
yawp and make the details of the images coming together. Then using the median 
filter method could eliminate the remaining noise in the image, make image smooth 
and loss of image detail was not serious. In order to preserve the details, use 3 x 3 
two-dimensional sliding window when making median filter. 


5 Central Machine 


The development of the central machine is the core of the whole system, more func- 
tions. According to parameters and the functions realized, we first used UML to mod- 
el and gradually refining the system's functions, made the system features modular, 
component-based, and finally got the system fully functional diagram and software 
component framework, and developed a component library management system. In 
the development process, reserved an interface for the existing HIS, so the system has 
good usability. When doctors, nurses used the system to access patient-related infor- 
mation, it could have direct access to the default modules, without the need go to find 
information such as patient records, this could save valuable time for first-aid and 
treatment for the patient and greatly improve the efficiency of the treatments. 

After designing topology architecture, based on the needs of the system, we de- 
signed a number of common components and a component library management sys- 
tem. This was available to project team members to use, not only improves efficiency, 
but also ensured uniform standards for each functional module control interface. More 
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importantly, it was convenient for the system development process. As demand 
changed, upgraded the system conveniently. We could achieve: single functional, 
common, simple interface, clear, highly independent. The component library man- 
agement system was completed for the following functions: find components, compo- 
nent expansion, component integration, component removed, component changes. 

"Patient Management Component" first entered the patient's basic information, se- 
lected the monitoring parameters, set the physiological parameter. Because of the 
reserved the HIS interface, we could enter the patient's hospital number to transfer the 
patient's basic information directly from the HIS system, and save the patient's physi- 
cal signs monitoring parameters into the HIS system. 

After setting physiological monitoring parameters, the real-time monitoring com- 
ponent started to work, where using multi-way monitoring and real time monitoring 
technique. Interface included digital display monitor, video display, parameter setting 
areas. Digital display and video image display area were floating windows, position 
could be adjusted and moved, without affecting the waveform display. Through set- 
ting the filter parameters, you could set the system state to monitor or record. On the 
monitoring screen, it could display from 4, up to 16 bedside machines’ all parameters. 
Each bedside machine could make video stream freeze, zoom and other operations, 
the system would real-time alarm for exceptional data, and display a specific excep- 
tion information or the patient's bed number. 


6 Summary 


The system was developed with technical innovations. The main innovative technolo- 
gy has the following two point: 1, Using the unique streaming technology, achieved 
real-time monitoring of unlimited number of remote sites and can view all the moni- 
toring data; 2, Using standard networking technologies to support IP unicast, multi- 
cast functionality, It may form point to point monitoring system, and also form a 
central monitoring system for each point; Setting a number of monitoring stations on 
the network to meet the needs of multiple users, and also formed multi-class monitor- 
ing system in WEB. With component design, system functions can be smoothly up- 
grade, as well as the interface with other medical systems, such as HIS, CIS and other 
system interfaces. The development and the successful commercial operation of 
nerve-central listen system, verified the correctness and feasibility of software devel- 
opment based on component. 
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Abstract. Strip shape is one of important quality indicators in cold strip steel 
production. The strip shape control system of six roller CVC rolling mill is 
studied in this paper. According to the measured data, the BP neural network is 
established, which identifies nonlinear part of the strip shape control system. 
Research results help a deeper study on the strip shape control system, which has 
important significance on obtaining good strip shape control. 
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1 Introduction 


The strip shape control is one of two important control quality indicators in the modern 
rolling production. Strip shape control directly influence end-product quality and 
market sales of the strip. Therefore, the development of strip shape control research has 
far-reaching significance to society and economic. At present, there are few studies on 
the strip shape control system through consulting a lot of related references. Therefore, 
this article analyses and researches the strip shape control system of six roller CVC 
rolling mill. 


2 Strip Shape Measurement 


According to the definition, the strip shape can be described as the flatness in direc- 
tion of rolling. The strip shape metering equipment of six roller CVC rolling mills 
adopts Siemens's BFI flatness measuring roll. There are mainly 5 kinds of actuators: 
the back-up roll depresses inclined, the intermediate roll bending, the working roll 
bending, the intermediate roll CVC shifting, and the working roll multi-area cooling 
system. 

The measure principle of strip shape is that different strip elongation results in dif- 
ferent tension of discrete points in the horizontal. It uses measuring roll to measure 
tension and indirectly measured strip flatness. Flatness measuring signal flow chart is 
as follow in figure 1. 
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Fig. 1. Flatness measuring signal flow chart 


3 Strip Shape Control 


The strip shape control system mainly includes two parts: learning efficiency of 
actuators flatness and actuator movement controller, control structure as shown in 
Figure 2. 
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Fig. 2. Flatness control structure 


The detailed organization diagram of flatness neural network control as shown in 
Figure 3,and NNI is the system identifier, Y is flatness actual value; Y,, is flatness 
setting value; Both Y and Y,, are vectors containing 52 elements; is the current 
flatness deviation; —_; is a flatness deviation which is determined by recently activated 
controls; Nzis flatness deviation which can be improved by the neural network iden- 
tifier(NNI) of actuator; P-P is the prior actuator efficiency; F is the current rolling 
force; Wis the current strip width; LP is the actual position deviation of actuator; g and 
p are efficiency which is in the direction of strap width including flatness deviation 
weighting factor and actuators; neural computing controller (NCC) is the controller 
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which is consist of the mathematical method. It sends out the optimal value to the 
actuator; E; is the actual improvement flatness deviation; E is the deviation between 
output value of network identification and the actual expected value, which is used in 
training the network. 
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Fig. 3. Flatness neural network control detailed diagram 


3.1 NNI Design 


3.1.1 Determination of Input and Output 

According to Figure 3, the neural network identifier can be achieved by using the BP 
neural network. According to the impact factor of NNI, the network inputs can be set to 
4: the current rolling force F; strap width W, optimal setting of actuator value U, and 
changes in the actual location of the executing agency LP. According to the function of 
NNI and the type of settings of secondary computer, the strip shape flatness deviation is 
used as the network output, which calculated by the direction of strap width of 52 
points, so the network outputs are 52 Real data. 


3.1.2 Determination of Hidden Layer and Nodal 

According to network input and output node number, and the training time of the 
network, the network is defined as the single hidden layer network structure. The node 
number is decided using the trial-and-error method, and finally determine the hidden 
node number are 16.The structure of the neural network identifier which is established 
as shown in Figure 4. 


3.1.3 Network Training 
Index function of the training network is: 


1 
E = — y > _ 1 
2p P k pn O on) : 


Where, p represents the training standard sample number; f,,,is network actual output; 
O,x is network expectation output. 
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Fig. 4. Neural network identification structure 


Use the gradient descent method to cause the minimum of the equation (1), namely 
using the generalized regular algorithm. The weight revision is: 


AW, (n+1) = 1-5 +0 AW, (n) 2) 


ji 
Where, W,j(n+J) is this time weight allowance; Wj(n) is previous time weight al- 
lowance; a is inertia factor, value between [0,1]; 77 is study factor, value between [0,1]. 


3.2 Calculation of the Optimum Value of Actuator 


NCC is the model, which is used to calculate the optimal settings of the actuator, the 
specific method is: when minimize the quadratic variance in the formula (3), corres- 
ponding U is the optimal settings of the actuator. 


‘i 2 
F=)'[g,(Ao, — p, (x) *U)] (3) 
i=l 
Where, x; is strap coordinate in width direction; 1 is the node number; g; is the deviation 
weight of flatness in position x;; p; is the efficiency of actuator in position x;; jis the 
flatness deviation in position x;; U is the optimum value of actuator which is going to be 
calculated. 

NCC is divided into five analysis modules in this project, corresponding to the five 
neural networks, which are used to calculate the efficiency of actuator. When using 
these five modules to calculate the latest setting value of the actuator separately, the 
order should be noted. The order is: the back-up roll depresses inclined, the interme- 
diate roll bending, the working roll bending, the intermediate roll CVC shifting, the 
working roll multi-area cooling system. Since the priority is different, so the output of 
the five actuators is different when the flatness deviation of input is different. For the 
back-up roll depresses inclined, the flatness deviation of input is the deviation between 
flatness observed value and the two-stage system flatness setting value; As for the other 
analysis modules which are used to compute the setting value of the other four actuator, 
the input is the deviation, removing the higher authority actuator corrected value from 
the smoothness deviation of first-level input (namely multiply efficiency by the most 
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superior setting value of this actuator). Then sending this differential value into the 
current actuator analysis functional model, the given quota of the current actuator will 
be obtained. 

In addition, the working roll multi-area cooling system is different from the other 
actuators after obtained the setting value of actuator. It directly sends into the tradi- 
tional PI regulator to adjust, while the output value of the other four flatness actuator 
need to be amplified and limited, then send to the implementing agencies. 


3.3 Strip Shape Control System Simulation 


The program is edited in MATLAB7.0 Environment in order to simulate the neural 
network model and mathematical model, and use the measured data to carry on off-line 
training. The predicted value of each actuator controlling amount is quite close with the 
display result of the live WinCC picture. Therefore, this model is correct. The WinCC 
configuration screen shows in figure 5. 
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Fig. 5. WinCC configuration screen 


4 Conclusion 


Using strip shape control system based on the neural network can greatly improve the 
control precision of strip shape in the rolling scene. Practice proved that the model can 
well control the flatness and the quality can completely achieve the anticipated goal, 
when the flatness of the coming strip meets the requirement. 
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Abstract. At present, furnace temperature settings are achieved by the operators 
in the rotary reheating furnace, and control accuracy is no good. The temperature 
prediction model of a tube billet surface at the exit based on RBF neural network 
and the modified particle swarm optimization are put forward to search the op- 
timal steady-state temperatures in this paper. The simulation results indicate that 
it meets heating process indicators and improve the heating quality and precision 
of the tube billet. The modified particle swarm optimization solves the local op- 
tima problem resulting from population degradation, and improves the search 
precision. 


Keywords: temperature control of rotary reheating furnace; RBF neural 
network; modified particle swarm optimization. 


1 Introduction 


In rotary reheating furnace, the temperature of the tube billet should be elevated up to 
intended temperature before its exit to the mill in order not to damage the rolling mill. 
Its heating quality, energy consumption and control level directly affect the quality, 
production and cost. At present, it mainly sets the temperature of each control zone by 
operators with experience in order to control the entire furnace process. But it is hard to 
meet billet heating process indicators and minimize energy consumption. Temperature 
optimal setting is a typical problem of optimal decision. CHAI et al. [1] established a 
temperature optimal settings model. It made use of the looking up table, a feedback 
model and the difference between temperature of the billet at the furnace exit and target 
temperature to calculate a modifier of temperature settings. Kim et al. [2] adopted the 
principal component analysis to classify the input data. A temperature settings model 
was established, which consisted of multiple experts neural networks and a door net- 
work. However, there are still some problems in these optimization methods. It is 
difficult to establish an accurate model because of creating a number of assumptions. 
Many of key factors are not available and so on. In industrial production, it limits the 
application of temperature optimal settings policy. 

The standard particle swarm optimization (PSO) algorithm has been used to search 
the optimal steady-state temperatures. It is easy to fall in local optima. Therefore, there 
have been some modified PSO algorithms in recent years, such as intelligent PSO [3], 
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fuzzy PSO [4] and other methods. It ensures the diversity of population, and improves 
the overall search ability of PSO. In the following pages temperature prediction model 
and modified PSO algorithm will be proposed. 


2 Configuration of Rotary Furnace Optimal Control System 


Configuration of rotary furnace optimal control system is adopted in figure 1. Where, g, 
d, v represent the tube billet type, diameter and rolling speed; charging temperature is 
room temperature; 7(T,,) is the temperature predicted value of the tube billet at the exit; 
T’ is the objective value; E is the difference between the temperature predicted and 
objective value; 7;(7) is a function of the temperature settings vector T,, in the distri- 
bution of time; 7; is a vector consisting of temperature feedback value of each control 
zone. Temperature controller adopts double-cross limiting PID controlling method. 


g.dv 


Temperatures settings te 
lookup table 


The method of Z-@¢)} Optimization of 
polynomial curve fitting steady-state ternperature 


Temperature prediction 
model of the round billet 


Fig. 1. Configuration of rotary furnace optimal control system 


The modified PSO algorithm is used to search the optimal steady-state temperatures 
settings vector T,-. According to g, d, v and temperatures settings lookup table with 
experience, it uses linear interpolation method to obtain the initial temperature settings 
vector T;,, which is used as the particles of initial swarm. It uses the minimum value of 
the objective function constituted by E and 7;(f) as a standard of searching the optimal 
steady-state temperatures. 


3 Temperature Prediction Model 


Heat transfer mechanism needs to create many assumptions, and many key factors are 
not available. So it is difficult to establish an accurate mathematical model [5], [6]. 
Therefore, this paper adopts radial basic function (RBF) [7] neural network to establish 
a temperature prediction model. RBF neural network is three layers forward neural 
network, which is composed of the input layer, hidden layer and output layer. It has 
features such as simple structure, short training time as well as nothing to do with the 
initial weights. Structure of RBF neural network is shown in Figure 2: 
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Fig. 2. Structure of RBF neural network 


Gaussian function of hidden layer is: 


u, =exp «| a armen | 


U 


Where, u; and o; are the output and standardized constant of the ith hidden node; T; is an 
input sample vector; g are the number of hidden layer nodes; c; is the center column 
vector of Gaussian function of the ith hidden node. 

The output of RBF neural network is: 


q 
T(r, )= Yau, -6 
i=l 


Where, 7(T;) represents output of output layer; @; is the weight coefficient from hidden 
layer to output layer; 0 represents threshold of hidden layer. 

RBF neural network model can accurately reflect the heat transfer process of the 
billet. Temperature of the tube billet surface 7(T;) at the exit is used as the only one 
output. Each zone temperature 7; is the input of the network. Where, me 1,2,...,7. 


4 Optimization of Steady-State Temperature Setting 


Each zone steady-state temperature setting is the key to ensure the heating quality of the 
billet. Billet temperature should reach a specified temperature at the exit, meet the 
requirements of rolling process, and minimize energy consumption. Thus, the optimal 
objective function is: 


1 eee 
JD,.) = 5 PUL.) [+50 [Tat 


When 7;, minimizes optimal objective function, the 7, is equal to 7T;,*. According to 
practical experiences and process analysis, T;(t) can be described as a quadratic func- 
tion of 7, in the distribution of billet heating time. It can be obtained by the method of 
polynomial curve fitting. Where, P, Q are weighing coefficients; 7, is the heating time 
of billet. 
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4.1 Standard PSO Algorithm 


The PSO [8] algorithm was first proposed by Eberhart and Kennedy in 1995, inspired 
by the natural flocking and swarming behavior of birds and insects. Each individual in 
particle swarm, called a “particle”, represents a potential solution which moves its 
position in search space and updates its velocity according to its own flying experience 
and that of its neighbor’s, aiming for a better position for itself at the next move. 

It is assumed below that the swarm consists of m particles. Thus, ie 1,...,m. During 
each of iterations, every particle in the swarm is updated using equations (1) and (2). 
Two pseudorandom sequences ¢ € Rand [0,1] and 7 € Rand [0,1] are used to affect the 
stochastic nature of the algorithm. For all dimensionse I,..., D, let xip, pin, Vip and Pen 
be the current position, current personal best position, velocity of the Dth dimension of 
the ith particle, and global best position, respectively. The velocity update step is ac- 
cording to Eq. (1) as follow: 


k+l ok k k k k 
Vip = Win +65 (Pip — Xin) +C,7(P yp — Xin) () 
The new velocity is then added to the current position of the particle to obtain its next 
position as follow: 
k+l k k+l 
Xip =Xipt Vip (2) 
Where, the acceleration constants c/ and c2 control how far a particle will move in a 
single iteration. The inertia weight @ is used to control the convergence behavior of the 
PSO. In general, the inertia weight @ is set according to the following equation: 
Ornax —~ Oni 
O= O,,. ~-—*——™ xk 
1eN.ax 


Temperature settings of seven control zones constitute the position of the particle. Ve- 
locity of the particle is used to search the optimal settings vector. The reciprocal of 
optimal objective function plus 0.1 is used as the fitness function. Standard PSO algo- 
rithm has a quick convergence rate in the early stages, but it is easy to fall in local optima 
in the later stages. In this paper, modified PSO algorithm will solve this problem. 


4.2 Modified PSO Algorithm of Velocity and Position Mutation 


Standard PSO algorithm has a lot of defects. Thus, a modified PSO algorithm is pro- 
posed in this paper, and mutation [9] strategy is introduced into it. 

The variable object of the modified PSO algorithm is current velocity and position of 
the particle swarm. The variation can happen during each of iterations and each di- 
mension of the particle, and break up centralized particles. It increases the diversity of 
population, and avoids the phenomenon of premature convergence. The modified PSO 
algorithm is as follows. First, initialize the position vector and associated velocity 
according to the temperatures settings lookup table, and evaluate the fitness of each 
particle via the fitness function. Velocity and position vector are reinitialized in each 
dimension on certain mutation rate. Second, compare the particle’s fitness evaluation 
with the particle’s previous best solution and population’s global best in order to search 
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population’s best solution, which is the optimal steady-state temperatures settings 
vector. Then, change velocities and positions by using Eq. (1) and (2). Finally, output 
population’s global best solution and corresponding fitness value at the end of the 
iterations. 


5 Verification and Discussion 


In a seamless steel tube plant, rotary reheating furnace can be divided into seven control 
zones: a preheating zone, five heating zones, and a soaking zone. The charging tem- 
perature are 20°C, the temperature of the tube billet at the exit should be between 
1,200°C and 1,290°C. It requires that the surface temperature error of the predicted 
values and measured values should be less than 30°C. 


5.1 Verification of the RBF Neural Network Model 


The steel grade is 20CrMo steel. It selects 50 groups of data as a network training 
samples, and the other 25 groups of data are used to verify the model. Figure 3 shows 
that temperature prediction error of the tube billet surface at the exit is less than 30°C, 
and it meets the requirements of the heating process. 
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Fig. 3. Predicted error of RBF neural network 


5.2 Verification and Discussion of Modified PSO Algorithm 


Preferences of modified PSO algorithm is: population sizes are 30; iteration times are 
100; cl=c2=2; mutation rate is 0.5. The target value is 1230°C. For the modified PSO 
algorithm, the bias is only the 16.197% of manual value, which is much less than the 
manual operation as shown in Table 1. Obviously, it can meet the tube billet heating 
process indicators and improve the heating quality. 
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Table 1. Surface temperature estimation results of a round billet at the exit 


Methods The difference between the target and the predicted value 
Manual operation 8.15 
Modified PSO algorithm 1.32 


The modified PSO and the PSO have the same parameters settings. Figure 4 shows 
that the improvement of the PSO has no significant change after the 13th generation, 
while, in modified POS algorithm, it can be improved to 29 generations. It can be seen 
that, the modified PSO can solve the local optima problem caused by population de- 
gradation of the standard PSO algorithm, and improve the search precision. 
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Fig. 4. Evolutionary curves of modified PSO and PSO 


6 Conclusion 


The RBF neural network model is established in this article, and the modified PSO 
algorithm is used to search T,,.*.The simulation indicated that the prediction error 
satisfies the request of the heating process, and this method is superior to manual op- 
eration and improves the heating quality. The modified PSO algorithm breaks up cen- 
tralized particles through mutating velocity and position, and improves the overall 
search ability and search precision. 
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Abstract. To rescue and evacuate people of unexpected events in city, it is very 
necessary to organize traffic system. On the base of analyzing traffic characte- 
ristics of unexpected events, traffic evacuation methods are submitted, such as 
combination of one-way traffic and special use way, no turn -left, temporary 
traffic channeling, one-way green wave, and estimation methods also are set. 
Estimation result illustrates that these traffic evacuation methods of unexpected 
events can improve network capacity and promote rescue and evacuation. 


Keywords: traffic engineering, unexpected events, traffic evacuation, one-way 
traffic, traffic channeling, traffic estimation. 


1 Introduction 


Emergency in city causes the drastic increase of traffic volume in local area within a 
short time, resulting in intensifying contradictions between traffic supply and demand, 
the formation of traffic jams, causing traffic chaos and creating difficulties for rescue 
and evacuation, having a great impact on normal traffic flow. However, the impact 
caused by this incident is temporary, therefore, generally it can not be solved simply 
relying on increasing road capacity (road building)[1,2], because this solution not only 
works very hard but also is uneconomical. Throughout the active respond to emergency 
effectively and successfully at home and abroad, it can be discovered that the key of 
relief the contradiction between supply and demand after the incident and ensuring a 
successful response to emergencies is to formulate and implement the traffic evacua- 
tion plan of emergency systemically, scientifically and rationally[3]. 

Traffic flow of emergency in city has focused on the characteristics of one-way, and 
it can easily form traffic shock waves which can result in concentrated outbreak of 
traffic demand in the local region, causing traffic congestion and traffic chaos. 

After analyzing and studying the traffic characteristics of unexpected events and 
various traffic evacuation methods, the author submits the traffic evacuation methods 
of unexpected events, such as combination of one-way traffic and special use way, 
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changing the way of turn-left in intersection, temporary traffic channeling, one-way 
green wave, and evaluation methods of traffic evacuation programs of unexpected 
events also are set. 


2 Traffic Evacuation Methods of Unexpected Events 


2.1 Combination of One-Way Traffic and Special Use Lane 


After the occurrence of emergencies in city, in order to evacuate the largest crowd in 
the shortest possible time, one-way traffic is the ideal method of traffic evacuation. 
Evacuation people of emergency with the characteristic of one-way evacuation, which 
coincides with the one-way traffic, so one-way traffic in the direction of the evacuation 
capacity than the two-way traffic could double[4] to speed up evacuation. 

But during the evacuation, we must make sure that a variety of special vehicles 
(ambulance, fire engines, etc.) can be fast and convenient access to event area. 
Therefore, special use lane must be opened up in the evacuation road, exclusively for 
dealing with emergency services vehicles, such as engineering emergency, fire rescue, 
ambulance or the implementation of urgent official matters by police. Any community 
vehicle is prohibited from entering, or staying in the lane for various reasons. In the 
period of public emergency evacuation, in the emergency special use lane, evacuation 
of the provisions only allow the rescue vehicle traffic and other vehicles can not enter 
the special lanes. 

Based on the above analysis, after the unexpected incident, the method of traffic 
evacuation that can ensure emergency vehicles to enter the incident area to perform a 
rescue while can evacuate the crowd as soon as possible is the combination of one-way 
traffic and special use lane, shown in Fig.1. 


Emergency Driveway —<<$<$—<—_—__=_=___j> 


Fig. 1. Combination of one-way traffic and special use lane 


2.2 Change the Way of Left Turn in Intersection 


Vehicles entering and leaving the intersection, due to the different moving direction, 
the conflict between vehicles is not the same way, resulting in confluence points, di- 
verging points and crossing points! 5] In the three types of conflict points, the points of 
crossing are produced by straight vehicles, turning left vehicles, turning left and going 
straight, turning left vehicles have the greatest impact on safety of traffic interference 
and running, followed by the confluence point, again, is the diverging point. 
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Intersections without traffic control, there are all kinds of conflict points. Their 
quantity is increasing significantly with the increase of lanes of roads that are crossed 
and the crossing points grow fastest among them. The crossing points are produced 
most by left-turn vehicles. Left-turn traffic at intersections is a major factor in the 
formation of conflict points and the main reasons of the formation of traffic conflict. 
Therefore, the rational evacuation of left-turn traffic is a effective method to ensure 
traffic safety and to improve intersection capacity[6]. 

After the occurrence of emergencies in city, in order to evacuate the largest crowd in 
the shortest possible time, the most effective way of evacuation methods of intersection 
is to prohibit turning left and it is feasible in the incident area. Because the evacuation 
of people with one-way evacuation will leave the incident area as soon as possible 
along the direction of evacuation, they need a very small chance of turning in the in- 
cident area. Prohibition of left-turn implemented, the reduction in the number of con- 
flict points at intersection will reduce the level of traffic conflict so that the capacity of 
intersection is greatly improved, especially the straight direction. 

The management of prohibition of left-turn implemented, left-turn lanes or 
straight-left lanes are changed directly to straight lanes. Because the headway of 
straight traffic is different from that of left-turn vehicles, according to the formula for 
calculating capacity, its capacity will be significantly changed. For non-signalized 
crossing, left-turn traffic has no impact on the straight vehicles so that the capacity will 
increase significantly. For signalized intersections, the implementation of left-turn ban 
will reduce the number of intersections of the phase, so that a cycle of straight traffic 
can be assigned to more passing time and there is a significant increase in traffic ca- 
pacity. In the sudden incident, people expect to leave the danger area away as soon as 
possible for their fear psychology, while left-turn traffic is the culprit of congested 
intersection and the production of delay. Therefore, in the event of an emergency, 
measures taken to prohibit left-turn can effectively improve the intersection capacity 
and reduce intersection delays. 


2.3. Temporary Traffic Channeling 


View of the temporary characteristics of traffic evacuation management measures of 
unexpected events, on the basis of using existing channeling facilities, facilities of 
temporary traffic channeling should become the main ones, such as barrier, separator, 
divided blocks, marking, temporary traffic signs, etc. 

For serious consequences after the emergency, it will be greatly inconvenient for the 
crowd in chaos, if there have been no temporary traffic channeling. For example, road 
traffic accident occurred, when the accident causes traffic disruption, then you can 
organize the traffic by means of temporary channels. Isolate the scope of the accident 
with barrier to ensure the on-site for sampling and to judge cause of the accident and the 
responsibility. And set up temporary signs at the intersections which are before and 
after the accident area to inform past drivers of the road break, please bypass to avoid 
congestion on the road. Showed by Fig. 2. 

The traffic can be orderly by using of temporary traffic channeling mode and 
evacuation will achieve more with less. 
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Fig. 2. Sketch of temporary traffic channeling 


2.4 One-Way Green Wave Control 


The main road in the urban road traffic network is the aorta which ensures the normal 
operation of urban socio-economic activities. A main road traffic running condition 
will directly affect the road traffic condition that it covers the large areas around. When 
signal control of intersection along the main road in the city done, in order to maintain 
the continuity of the main road traffic to the extent possible, the intersection along the 
main road should be managed group by group, based on road and environmental con- 
ditions, namely, urban trunk road section traffic signal base is implemented system 
control characterized by coordinated form[7,8]. 

On the collector-distributor roads that public emergencies in the city mainly occur, 
traffic signal coordination control is a effective measure to improve the road capacity 
within the event. After the incident, traffic flow on the road with very obvious cha- 
racteristic of uneven direction, presenting a one-way flow. According to this feature, 
the “Green Wave Control” can implement “one-way Green Wave Control” different 
from the conventional “two-way Green Wave Control”. 

After the occurrence of emergency in the city, consideration at that time is how to 
evacuate the crowd quickly in the shortest possible time. When organizing traffic by 
using one-way green wave, there is no need to take road speed into account and just the 
distance between adjacent intersection and intersection signal cycle are needed to 
consider to build trunk one-way Green Wave Control. Therefore, it is more simple than 
the two-way Green Wave Control, and more quickly and efficiently, and it can eva- 
cuate the largest crowd in the shortest possible time. 


3 Evaluation of Traffic Evacuation Methods of Unexpected Events 


It can be seen from the above analysis that measures applied to public emergency in the 
city are mainly one-way traffic, prohibition of left turn, temporary traffic channeling 
and one-way green wave control, of which one-way traffic is the more core content. 
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Measures of prohibition of left turn and temporary traffic channeling are both aimed at 
the implementation of one-way traffic and one-way Green Wave is on the basis of 
one-way traffic. 


3.1 Intersection Capacity of Traffic Evacuation of Unexpected Events 


As we all know, the greatest impact on traffic in the intersection is the conflict point 
which caused by turn left and straight car. After the one-way traffic the number of 
conflict points significantly reduced, as shown in Fig.3, which undoubtedly will im- 


prove the intersection capacity. 
= . ae 


red a 
(c) 
Fig. 3. Traffic conflict of one-way traffic 


gree 


Pe 
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Following the intersection capacity on the one-way traffic signal controlled was 
calculated by using the stop line method according to the “urban road design standard”. 
Design capacity of straight drive was calculated by the following formula: 


N, = 3600x@,[(t,-t,)/ t,, +H/t, (1) 


Where N , —The design capacity of one straight lane, 

t.—Signal period, 

Ly —tThe green time in one signal period, 

t, —The time when the first car start and through the stop the line after the 
green light on, the value can be used 2.35, 

t,, —The average interval of straight or right vehicles through the stop 
line, 

~, —The reduction factor of capacity of straight drive traffic, the value 


can be used 0.9. 


In the case of Figure 3 (b), the design capacity of imported road A is 
N,=N,,+N,,, because of N, =N, =N,, 


So N, =2N,, 
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Where N,, —The design capacity of one straight and left lane, 


N ,, —The design capacity of one right lane. 

In the case of Figure 3 (c), the design capacity of imported road B is 
N, =N,, +N,. Because of taking the one-way traffic, the capacity of imported 
road C is OQ and the design capacity of imported road D_ is 
N,=N, =N(EB/2. 

Where , is the percentage of the left turn car in the right and left lane before taking 
the one-way traffic control. 


The design capacity of one-way traffic intersection is calculated by the formula (2): 


Nonoumy =N,+NytNo+Np =2N, +N, +04+N (EB /D=N,4B/D (2) 


oneway 


But Noy a Ny +N,(-f,/2D 


Where N ,,, —The design capacity of one straight, left and right lane; 
So the capacity of a normal cross intersection of signal control is calculated as 
formula (3): 


N 


normal 


=N,+N,+N +N, =4N,, + N,4 28) (3) 


After the intersection is changed into the one-way traffic, the improved percentage of 
the capacity of intersection is expressed as formula (4) : 


oneway N normal B 1 


N x 100% = ——_—_ x 100% (4) 


normal N 46 1 
According to the above formula (4), it can be seen that the greater the proportion of the 
left turn cars in intersection, the greater the rate of increase of the intersection capacity 
after the intersection changed into a one-way traffic. 


A= 


3.2. Section Capacity of Traffic Evacuation of Unexpected Events 


After using the evacuation of one-way traffic, the design capacity of road section is: 


The possible capacity of a driveway is N gS ,where f, is the headway. 


The design capacity of driveway affected by the intersection is: 
Np =a.°A4°N, (5) 


Where: & .—Road classification factor of capacity of driveway; 


a , —Influence factor of intersection. 
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Where a, = 


Where: /—The distance between the intersection; 
v—design speed of section; 
a—tThe average acceleration of start: 
b—The average deceleration of braking; 


t .—The waiting time r of vehicles between intersection. 


It is not difficult to see from the formula (5) , in the case of certain other conditions, 
two-way traffic changed into one-way traffic, due to the improvement of intersection 
capacity, the waiting time of vehicles at intersection is reduced, resulting in the im- 
provement of road section capacity. In addition, because two-way traffic is changed 
into one-way traffic, one-way traffic direction on the number of lanes increase and the 
road section capacity will be significantly enhanced also. 


4 Conclusion 


Environment and energy disasters and emergencies brought by densely populated 
cities, frequently economic activities and all types of dense buildings are threatening 
the production and life of urban residents continually, even life. More and more fre- 
quent emergencies are testing the emergency response capacity of modern urban 
managers increasingly. The traffic in response to unexpected events plays a key role. 
As sudden changes in traffic flow caused by the emergency, the conventional traffic 
evacuation is very difficult to adapt to changes in traffic flow. There is need to research 
traffic evacuation methods for emergencies according to changes in traffic flow cha- 
racteristics. This article is based of unexpected events traffic flow with characteristics 
of one-way evacuation, proposing traffic evacuation methods that fit the characteristics 
of one-way evacuation ,such as one-way traffic, special road, organization of intersec- 
tion turning, one-way Green Wave and so on. It is hoped that it can provide a reference 
for the traffic management department and lay the foundation for the development of 
traffic emergency plans. 
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Abstract. Image registration is playing an increasingly important role in mod- 
ern medical image processing. It refers to the process of overlaying two or more 
images of the same scene taken at different times, from different viewpoints, 
and/or by different sensors. During the last decades, a large number of image 
registration methods have been described in the literature. Unfortunately, there 
is no one method that works very well for all applications. In this paper, we 
propose a new approach using improved Particle swarm optimization for medi- 
cal image registration. The algorithm has been successfully used for medical 
image registration. The feasibility of the proposed method is demonstrated and 
compared with the standard PSO based image registration technique. The expe- 
rimental results show that the propose method better results than the standard 
PSO method. 


Keywords: image registration; improved particle swarm optimization; mutual 
information. 


1 Introduction 


Medical imaging is about establishing shape, structure, size, and spatial relationships 
of anatomical structures within the patient, together with spatial information about 
function and any pathology or other abnormality. Establishing the correspondence of 
spatial information in medical images and equivalent structures in the body is funda- 
mental to medical image interpretation and analysis. In many medical clinical applica- 
tions, images of similar or differing modalities often need to be aligned as a prepro- 
cessing step for many planning, navigation, data-fusion and visualization tasks. This 
alignment process is known as image registration. 

Since the mid 1980s medical image registration has evolved from being perceived 
as a rather minor precursor to some medical imaging applications to a significant 
subdiscipline in itself. Entire sessions are devoted to the topic in major medical imag- 
ing conferences, and workshops have been held on the subject. Image registration has 
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also become one of the more successful areas of image processing, with fully auto- 
mated algorithms available in a number of applications [1, 2, 11]. 

Medical image registration has a wide range of potential applications. These 
include[1]: 


1) Combining information from multiple imaging modalities, for example, when 
relating functional information from nuclear medicine images to anatomy delineated 
in high-resolution MR images. 

2) Monitoring changes in size, shape, or image intensity over time intervals that 
might range from a few seconds in dynamic perfusion studies to several months or 
even years in the study of neuronal loss in dementia. 

3) Relating preoperative images and surgical plans to the physical reality of the pa- 
tient in the operating room during image-guided surgery or in the treatment suite 
during radiotherapy. 

4) Relating an individual’s anatomy to a standardized atlas. 


Image registration is the process of overlaying two or more images of the same scene 
taken at different times, from different viewpoints, and/or by different sensors [3]. In 
recent years a number of image registration approaches have been devised. These 
techniques can be divided into feature-based techniques and intensity-based tech- 
niques. Feature-based techniques require some preprocessing, prior to registration, to 
extract relevant information, like anatomical land-marks, edges or shapes. In contrary 
to feature-based techniques, intensity-based measures get by without prior preprocess- 
ing. Thus images can be registered right after image acquisition. Intensity-based 
measures use the full raw image information for image alignment. Here we adopt the 
latter approach. 

The focus of the current paper is the search strategy (optimization) for maximizing 
the similarity metric for registering images. This paper focuses on a new improved 
PSO technique. To the authors’ knowledge, this technique is relatively recent and is 
better know for optimization but has not been applied to medical image registration 
previously. In the following sections, we will introduce this method in more details, 
and then apply it to medical image registration. Theoretical analysis and experiments 
show that this method is effective and accurate to register medical images. 

This reminder of this paper is organized as follows. We first introduce the im- 
proved particle optimization algorithm used as optimization method in section 2. 
Then we introduce the mutual information used as registration criterion of two images 
in section 3.Our experiments and discussion is in section 5. 


2 Particle Swarm Optimization 


Particle Swarm Optimization (PSO) is a recently proposed algorithm by James Ken- 
nedy and R. C. Eberhart in 1995, motivated by social behavior of organisms such as 
bird flocking and fish schooling[2]. PSO algorithm is not only a tool for optimization, 
but also a tool for representing sociocognition of human and artificial agents, based 
on principles of social psychology. The particle swarm concept originated as a simu- 
lation of simplified social system. The original intent was to graphically simulate 


Medical Image Registration Based on Improved PSO Algorithm 489 


the choreography of bird of a bird block or fish school. However, it was found that 
particle swarm model can be used as an optimizer. During the past few years, PSO 
has been successfully applied to multidimensional optimization problems, artificial 
neural network training, and multiobjective optimization problems. 

Particle Swarm Optimization optimizes an objective function by undertaking a 
population-based search. The population consists of potential solutions, named par- 
ticles, which are metaphor of birds in flocks. These particles are randomly initialized 
and freely fly across the multi dimensional search space. During flight, each particle 
updates its own velocity and position based on the best experience of its own and the 
entire population. 


2.1 Original Particle Swarm Optimization 


Mathematical notation of original PSO is defined as follow: 


An individual particle i is composed of three vectors: its position in the D- 


dimensional search space x, =(%)1,%X;7,--Xjp) » the best position that it has 
individually found PD; =( Pio Piao Pip) ; and its velocity 


v, = (VasVissiaVind) Particles were originally initialized in a uniform random man- 


ner throughout the search space; velocity is also randomly initialized. 

These particles then move throughout the search space by a fairly simple set of up- 
date equations. The algorithm updates the entire swarm at each time step by updating 
the velocity and position of each particle in every dimension by the following 
rules[4]: 


Vig =Viq tC, *rand()* (Pig — Xig A Cy * Rand()* (Pea = 555) (1) 


Xia = Xia + Via (2) 


where in the original equations C, and C, are a constant with the value of 2.0, rand 
and Rand are independent random numbers uniquely generated at every update for 
each individual dimension d = | to D, and P, is the best position found by any 


neighbor of the particle. 


2.2 Improved Particle Swarm Optimization 


Particle swarm optimization for image registration was introduced in [5]. The basic 
PSO algorithm can be enhanced in a variety of ways. 

Although the original PSO algorithm has ability of exploiting the global maximum, 
it can not guarantee to achieve the maximum but often falls into local maxima. To 
overcome this shortcoming, In this paper, we use the following equations to instead 
equation (1) and (2)[6]: 
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Vi, =W*v,, +c, *rand()*(p,, —X,7) 


3 
+0, * Rand ()*(D oq —Xy) ae 


Xia = Xia T Via (4) 
w=u+0* N(O,1) 
had = Hin + CLS — Manin ) ‘3 rand (0,1) 


In the above equations, N (0,1) is a random numbers drawn from the standard nor- 
mal distribution. 


For each particle 
Initialize particle 
For each particle 
Calculate fitness value 
If the fitness value is better than the best fitness 
value (pbest) in history 
set current value as the new pbest 
End 
Choose the particle with the best fitness value of all 
the particles as the gbest 
For each particle 
Calculate particle velocity according to Eq.1.3 
Update particle position according to Eq.1.4 
End 
Continue while maximum iterations or minimum error 


criteria is not attained 


Fig. 1. The Pseudo code of the improved PSO procedure 


3 Mutual Information 


Mutual information(MI) is a basic concept originating from information theory, mea- 
suring the statistical dependence between two random variables or the amount of 
information that one variable contains about the other. Mutual information is an in- 
formation theoretic topic that has quickly become one of the most popular techniques 
available for use in image registration[9]. 

MI originated many decades ago from works based on entropy and its roots stem 
from information and communication theory. However, it was first proposed as a 
registration measure in medical image registration in 1995, independently by Viola 
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and Wells[7] and by Collignon[8]. The mutual information I of two images X and Y 
is defines as 


1(X ,Y)=H(X)+H(Y)-H(X,Y) 


where H(X ) and H(Y) are individual entropies and H( X,Y) is the joint entro- 


py. For two images, the mutual information is computed from the joint probability 
distribution of the images’ intensity or gray-values. When two images are aligned, the 
joint probability distribution is “peaky” resulting in a high mutual information value. 
The greater the value of MI, the better the match between the two images, so image 
registration becomes a typical maximization problem. The definitions of these entro- 
pies are 


H(X) =->)P, (x)log Py (x) 
H(Y) =-)°P,(y)log P,(y) 


H(X,Y) =->) Py (x, y)log Py (x, y) 


Ky 


where P(x) and P,(y) are the marginal probability mass functions and 


ry (xX, y) is the joint probability mass functions. 


The success of MI registration lies in its simplicity as it is considered to be quite a 
general similarity measure. It makes very few assumptions regarding the relationship 
that exists between different images. Assumptions regarding linear correlation or even 
functional correlation are not made. 


4 Image Registration Model 


Fig. 2 shows the framework of our proposed method in which the optimizer is based 
on the Improved Particle Swarm Optimization(Eq. 3 and Eq. 4). 


Fixed image a MI Metric 


v 
A 


Optimizer 


Moving image > Interpolator 


4 


Transform ~t 


Fig. 2. Framework of our proposed method 
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5 Results 


In order to evaluate the proposed PSO-based image registration, we compared the 
proposed method with original pso method-based image registration. 

In this section, we perform several registration experiments with medical image da- 
ta to evaluate the performance of the proposed technique. Moreover, we also perform 
conventional PSO for comparison. 


(a) target image (b) floating image 


Brain CT data and MR data are used as the reference image and the test image, 
respectively. 


Table 1. Registration results of experiment 1 


T, T, a) 
Ground truth 7 0 2 
Standard PSO 9 1 0 
Our algorithm 7 0 2 


(a) target image (b) floating image 
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Table 2. Registration results of experiment 2 


T, T, 0 
Ground truth 13 17 10 
Standard PSO 20 14.5 6.4 
Our algorithm 13.1 15.7 10.1 


6 Conclusion 


In this paper we have presented a new method for the registration of medical images 
which is based on the combination of mutual information and a new improved PSO 
technique. Results show that the proposed technique is better than the original PSO 
technique. According to the experiment, we can conclude that the method proposed is 
more effective than the traditional PSO method. 
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Abstract. In this paper, the principle and characteristics of friction drive is ana- 
lyzed. The mathematical model of the LAMOST telescope mount drive control 
system is established and the fuzzy sliding mode control theory is put forward to 
contrast with traditional PID control and ordinary sliding mode control, which is 
realized by Matlab/simulink simulation. The simulation results show that the 
fuzzy sliding mode control has good performance of speed tracking. 


Keywords: servo system; sliding mode; fuzzy sliding mode. 


1 Introduction 


Friction is a common physical phenomenon, which exists in all sports. In servo system, 
friction that can cause some adverse impacts of limit cycle oscillation and low speed 
creeping of system is a major factor affecting the low-speed performance. The impacts 
can be reduced by changing the mechanical properties or using high-precision con- 
troller, considering the control algorithm is a better choice to inhibit the impact. Sliding 
mode control, which has strong insensitivity to the model error and parameter changes 
of and external disturbance, has strong robustness. The simulation on Matlab/simulink 
proves the superiority of fuzzy sliding mode control. 


2 The Structure of LAMOST Mount Drive Servo System 


LAMOST mount drive servo system is a two-axis servo system which adopts friction 
drive and uses permanent magnet DC torque motor to drive. The control model estab- 
lished according to the laws of kinematic is shown in Figure 1. 

In Figure 1, w represents the control input signal; L and R respectively represents the 
total inductance of armature circuit and the total resistance; K, represents the motor 
torque coefficient; T,,, represents motor shaft torque; T,, represents the equivalent torque 
which converted on the motor shaft; J., represents the equivalent moment of inertia 
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which converted on the motor shaft, N represents the friction drive reduction ratio; K, 


represents the inverse EMF constant; S represents the Laplace operator; 6 represents 
the speed; 6 represents the tracking position. 

The state space equation of the telescope mount drive control system can be obtained 
as follows: 


(x, -T,,) () 


x3 = 2 (Ku K,K.Nx~ Rx) 


- a =a 1 > K Tn a | : >| | Ay} 1 a, 
Nw, Ls+R Z Ney Fog N S 
K, 


Fig. 1. The model of LAMOST mount drive servo system 


3 The Model of Friction 


The Stribeck friction model which is shown in Figure 2 is used in this paper. 


When | O(t) Ik a, the static friction is 


Le F(t)> Fi, 
F( =F) —-F,,<F<F, (2) 


_ Fi, F, < —Fiy, 
When | @(f) > @, the dynamic friction is 
E =| F ACF, — Fe 1" |sgn((t) +k 8 (3) 


where, F, represents the driving force; F,, represents the maximum friction; F, 
represents the coulomb friction; Kyrepresents the ratio coefficient of viscous friction 


torque; O(t) represents the rotation velocity; a, a, represent the small and positive 
constants. 
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Fig. 2. Friction-Speed steady state curve(Stribeck curve) 


4 Design of the General Sliding Mode Controller 


4.1 Design of Controller 


@ represents the position signal of the azimuth axis. The drive motor which used in the 
simulation model is the permanent magnet DC torque motor manufactured by Precision 
Motor Factory of Chengdu. To ignore the armature inductance, the position state equ- 
ation of the drive control servo system can be described as follows: 


0 1 0 0 

xi(t) X,(t) 

4 = KK if K, ju@-| 1 jF-@ (4) 
0 a eé t x, (t) pea ew os 

X2(t) JR JNR JN 


Where, x,(t) = O(t) represents the rotation angle; x, (t) = @(t) represents the rotation 
velocity. 


To design the switching function as s=ce+e and adopt the exponential reaching 


law as s=—€sgn(s)—ks, where € >0,k >0, the follows can be got: 


: ean Ra K, 7 Fry 


s=cet+tr-—( (5) 
JR JINR JN 
Follows can be got based on the above fomula: 
JNR,/* * K,K,: F, 
- + rte + ks +——* x4 6 
u 7 (ce+r+eésgn(s)+ks 7 x TN? (6) 
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4.2 Analysis of Algorithm Convergence 


To adopt the Lyapunov algorithm convergence and take the Lyapunov function as 
follows: 


V(x) = = (7) 


Follows can be got according to s=ce+e and s =—€sgn(s)—ks, we can get: 


V(x) =ss=(cet+ Bike sgn(s)—ks]=—(ce+ aie sgn(s)+ ks] (8) 


Where, €>0,k>0 , and V(x)<0 when s<O , sgn(s)<O ; when 
s>0,sgn(s) >O and V(x)<0. The system is asymptotically stable according to 


Lyapunov stability of the algorithm, and ss <0 is permanent establishment, and then 
the reaching condition of sliding mode is satisfied. System dynamics will arrive at 
the sliding surface within a limited time and stay on it, so the controller designed is 
feasible. 


5 Design of Fuzzy Sliding Mode Controller 


Fuzzy sliding mode controller is usually composed of the equivalent control u,, and the 
switching control u,,,, and the form of the control volume is u=Ugq Us. To adopt a 
two-dimensional fuzzy controller, and suppose the fuzzy controller input is S and ds. 
The switching sliding mode control u,,, can be obtained according to the output of fuzzy 
controller. 


(1)Fuzzy 
Suppose the domain of S, ES and U is ft 3, —2, —1,0,1,2,3} sand the corresponding 
subset of fuzzy language is {NB, NM, NS, ZO, PS, PM, PB}. 


(2)Fuzzy rules and fuzzy reasoning 

To design the fuzzy control rules based on the existence and the reaching condition of 
sliding mode, which are shown in table 1. The fuzzy rule is: If S is A and ES is B, then 
Uis C. 


(3)De-fuzzy method 
Using the center of gravity method, defuzzification formula according to the center of 
gravity method is shown as follows: 


Deu) *U 


—2 A) 


U) =—S — (9) 
Deo) 


U 
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Table 1. Fuzzy rules 


S NB NM NS ZO PS PM PB 

ds 

PB ZO PS PM PB PB PB PB 

PM NS ZO PS PM PB PB PB 

PS NM NS ZO PS PM PB PB 

ZO NB NM NS ZO PS PM PB 
NS NB NB NM NS ZO PS PM 
NM NB NB NB NM NS ZO PS 

NB NB NB NB NB NM NS ZO 


6 Simulation 


To create a simulation model of the system in Matlab / Simulink, in which the position 
tracking signal is /0"sin(0./t), and the corresponding speed signal is /"cos(0.1t). The 
simulation results are shown in Fig.3, Fig.4 and Fig.5 
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Fig. 3. PID control tracking curve 
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Fig. 4. Sliding mode control tracking curve 


Fig.3 shows that, when adopting the PID control the position error is between 
+0.02" and -0.02", the speed error is between +0.3'7/s and -0.3'/s, and the speed signal 
chatters when crossing the zero. The simulation results show that the conventional PID 
control is ineffective on the low speed system which contains more nonlinear interfe- 
rence. Fig.4 shows that, the tracking accuracy of the position and the speed are more 
highly improved than conventional PID control when using the sliding mode control 
,and however, the chattering phenomenon is produced. Fig.5 shows that, both position 
tracking and speed tracking can achieve the system performance requirements, and the 
chattering in the general sliding mode control is weakened. Meanwhile the system 
tracking accuracy is improved significantly. 
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Fig. 5. Fuzzy sliding mode control tracking curve 


7 Conclusion 


The control of high-performance servo system is very difficult. This paper takes the 
LAMOST telescope mount drive system, which has the control difficulties of large 
inertia, low speed and multi-disturbance, as the research object. The conventional PID 
control is ineffective on the low speed system which contains more nonlinear interfe- 
rence. The simulation results show that the sliding mode control can achieve the per- 
formance requirements of the drive system, and can obtain a satisfactory low-speed 
tracking performance, and however chattering exists. Fuzzy sliding mode control, 
which designs the fuzzy rules based on reducing chattering, effectively reduces the 
chattering of sliding mode control, and improves low-speed tracking performance. 
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Abstract. Synthesizing the advantages and disadvantages of each kind of maxi- 
mum power point tracking (MPPT) control method, one improvement control 
method is proposed. This method uses the constant voltage method to find the 
approximate location of maximum power point rapidly in the earlier period, and 
uses hysteresis comparison method to increase the precision in the later period. 
Compared with the traditional MPPT control method, this method tracking speed 
is quicker, and the tracking precision is higher, and can realize the smooth start 
under each kind of changeful environment. 


Keywords: solar cell; MPPT; control method. 


1 Introduction 


Driven by increasing costs and decreasing reserves of fossil-fuels, as well as by global 
environmental concerns, renewable energy is becoming a significant fraction of the 
total energy generation. [1] The solar energy already obtains more and more attention 
of various countries as the most valuable renewable energy source in the world. But 
the external environment is changeful (for example temperature, sunshine). The solar 
cell needs maximum power point tracking to use solar energy by the maximum effi- 
ciency. At present, the methods of maximum power point tracking (MPPT) are many, 
such as the constant voltage method, the perturbation and observation method, the 
incremental conductance method, the fuzzy control method, hysteresis comparison 
method [1-5] and so on, but the different methods have the different advantages and 
disadvantages in the actual application. In order to use the merit of each method fully, 
one improvement starting characteristic MPPT method is proposed in the foundation 
of the analysis to several common MPPT methods.The tracking speed and the preci- 
sion could raise obviously by this method. 


2 The Equivalent Model and Characteristic 
2.1 Solar Cell Equivalent Model 


The equivalent circuit model of solar cell unit is shown in Fig. 1. And, J, is the 


ph 
photoproduction electric current,which is in direct proportion to the area and the 
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incident light intensity of solar cell. 7, is the characteristic that the solar cell unit is 
similar to the ordinary diode in the non-illumination's situation because of the internal 


semiconductor PN ties. R, is the series resistance. R,, is the shunt resistance. Gener- 


ally R, is very small, but R,, is very big. 


tion i VY Rsh lr 


Fig. 1. Solar cell equivalent model 


The relations of the solar cell's operating current and the voltage are as follows: 


R U 
I = —“__[] , - — - 1] (1) 
R,+R,, : Ry, 


Where 
qu »y 
i= I, (e ee —1) (2) 


With combination (1) and (2), obtain 


Unf UHR, 
I=1,,-Ihle fae ala (3) 
Ry, 
I, ------- diode saturation current, [ oe Photoproduction electric current, [ ------- 
solar cell operating current, q------- electron charge 1.6X107'"°C ,U ae solar cell 
operating voltage, K------- Boltzmann constant 1.3810 J / K ,T------- absolute 
temperature A------- diode characteristic factor 


In ideal situation, R,=0, R,, >, A =1. 


qU rf 
= KT 
I=1,.-I,le —1] (4) 
When I=0, it can obain the open-circuit voltage of solar cell. 


kT. I 
Vi. =—In(—* +) (5) 
qd I 


oc 
0 
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2.2 Solar Cell Characteristic 


The open-circuit voltage and the electric current of solar cell change with the strength 
of illumination and the cell array temperature. Under certain strength of illumination, 
the I-V characteristic curve and the P-V curve of solar cell are shown in Fig. 2 and 
Fig. 3. 


and zzz eee eee | 
i) 4 4 6 8 10 12 14 16 18 20 ee 
9) Vm Voc a vm 


Fig. 2. I-V characteristic curve of solar cell Fig. 3. P-V characteristic curve of solar cell 


The position marked in the figure is the maximum power point. From the figure, it 
shows that solar cell has the obvious misalignment. 

Fig. 4 and Fig. 5 are the I-V and P-V characteristic curve of solar cell under the 
same temperature and different sunshine (S) separately. From the figure, it shows that 
the output power of solar cell array increases with the increase of luminous intensi- 
ty(S1<S2<S3). 
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Fig. 4. I-V characteristic curve under different Fig. 5. P-V characteristic curve under differ- 
sunshine ent sunshine 


Fig. 6 and Fig. 7 are the I-V and P-V characteristic curve of solar cell under the 
same sunshine and different temperature (T) separately. From the figure, it shows that 
the output power of solar cell array decreases with the increase of temperature 
(T1<T2<T3). 
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Fig. 6. I-V characteristic curve under different Fig. 7. P-V characteristic curve under differ- 
temperature ent temperature 


3 The MPPT Algorithm Comparison 


3.1 The Perturbation and Observation Method 


The perturbation and observation method (also called mountain climbing method) is 
one kind of control algorithm which is used most widely at present. Generally it con- 
trols the output voltage of the solar cell board through the control of the DC-DC 


switch's dutyfactor. First record the first time dutyfactor D, , and write down the out- 
put voltage V, and the electric current /, of the solar cell at this time, and calculate the 
powe P.. Next, let D, = D, + AD and measure the voltage V, and the electric cur- 
rent [ , at the second time, then calculate the power P,. D is decided to increase or 
reduce through the comparison of ‘yi and P, , accordingly, the maximum power point 


is found. Selecting the different size AD, the tracking speed and the precision are 
different. When ADis bigger, the tracking speed is quicker and the precision is lower. 
On the contrary, When AD is smaller, the tracking speed is slower and the precision 
is higher. 

The perturbation and observation method has the merit of simple principle, quick 
tracking speed and high tracking accuracy, however, it also has the inherent short- 
coming. It has the oscillatory occurrences and create the nonessential power loss for 
this nearby the maximum power point. When the external environment changes 
fiercely, it has the possibility to produce voltage collapse phenomenon [6]. 


3.2 The Incremental Conductance Method 


The incremental conductance method decides the direction of the perturbation mainly 


through the judgment of positive and negative of , and it is actually one kind of 


distortion of the perturbation and observation method. From the PV curve of solar cell 
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dP 
board in Figure 3, it shows that: When —— 50, the curve is at the left side of the max- 
imum power point, and needs to increase the dutyfactor to increase the output voltage; 


When —— <0, the curve is at the right side of the maximum power point, and needs to 


dP 
decrease the dutyfactor to decrease the output voltage; When —— =0,it is at the max- 


imum power point. 

The incremental conductance method eliminates oscillatory occurrences of the per- 
turbation and observation method completely, but it still has the voltage collapse 
phenomenon. Because of its complex algorithm, high measuring accuracy request and 
big cost, it is very difficult to be used widely. 


3.3. The Constant Voltage Method 


The principle of the constant voltage method is that the voltage of solar cell has the 
approximate proportional relationship with the open-circuit voltage at the maximum 
power point. Moreover this scale factor is invariant nearly when the sunshine and 
temperature outside changes. The working voltage can be adjusted through the meas- 
ure of the open-circuit voltage of the battery board. Thus the maximum power point 
could be found. 


Vuax = M, XV, (6) 


thereinto, M,, is scale factor. 


This control method is simple and the facility cost is low, however, the tracking 
accuracy is low and the error is big. 


3.4 The Constant Electric Current Method 


The principle of the constant electric current method is that the operating current of 
the solar cell has the approximate proportional relationship with the short-circuit cur- 
rent at the maximum power point.This scale-up factor,the same as the constant vol- 
tage method, is invariant nearly when the sunshine and temperature outside changes. 
The operating current can be adjusted through the measure of the short-circuit current 
of the battery board. Thus the maximum power point could be found. 


Iuax =M_XI,, (7) 


thereinto, , is scale factor. 


The tracking accuracy of this kind of control method is low. Because it needs to let 
the solar cell board short circuit for measuring the short-circuit current, there is great 
affects to the life of the cell board. It holds the inferiority compared with the constant 
voltage method. 
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3.5 Hysteresis Comparison Method 


Hysteresis comparison method is proposed to overcome the oscillatory occurrences 
and the voltage collapse phenomenon of the perturbation and observation method at 
the maximum power point. It does not make any movement when the external envi- 
ronment changes fiercely, and only carries on the tracking when the environment is 
stable. The concrete principle is as follows: 


c B 4A 
¥ (\ ‘ { N 
A 
A c A c 
c 


Fig. 8. Each kind of situation nearby the maximum power point 


It needs to measure the duty factor D, at a time , the voltage V, and the electric cur- 
rent J, nearby the maximum power point and calculate the power F,; Next let 
D,=D,—AD and measure V,, 1, at this time and calculate the power P ; Next 


let D, =D, + AD and measure V, , J, and calculate the power P. . Last set a com- 
parison mark Tag. 

If P. >= F,, Tagl=1; or Tagl=-1;If P, > P, , Tag2=1; or Tag2=-1; 

calculate Tag=Tag1+Tag2; 


If Tag=2, let D, = D., and make the operating point move to C; 


If Tag=2, let D, = D.,, and make the operating point move to A; 


If Tag=0, The operating point is motionless and the maximum power point is 
found. When external environment changes fiercely, Tag=0, it still regards it as the 
maximum power point and does not make any movement. 


4 MPPT Improvement 


It is known that hysteresis comparison method is able to eliminate the oscillatory 
occurrences and the voltage collapse phenomenon from the analysis of several kind of 
control algorithm, but it requests the three spots A, B, and C are nearby the maximum 
power point. The structure of the constant voltage method is simple, the cost is low 
and could find the maximum power point fast, but the tracking accuracy is insuffi- 
cient. One better algorithm is synthesizing the merit of these two methods. In the 
earlier period it uses the constant voltage method to find the maximum power point 
rapidly and in the later period uses hysteresis comparison method to enhance control 
precision. Such system has the most superior starting characteristic; the toggle speed 
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is quick and steady; the power increases monotonously,; it will not emerge the fluctu- 
ation phenomenon. 

In order to adapt the changement of the external environment fast, the timer setting 
function is used to skip to the constant voltage method periodically to find the maxi- 
mum power point. The flow diagram is as Fig. 9. 
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Fig. 9. Flow diagram of improvement control algorithm 


5 Conclusion 


This article carrys on the research to the maximum power point tracking of the photo- 
voltaic system using the improvement control algorithm. The improvement control 
algorithm could find the maximum power point fast when the external environment 
changes fiercely. It has the merit of simple control policy, smooth startup process, 
quick tracking speed and high tracking accuracy. It eliminates oscillatory occurrences 
and the voltage collapse phenomenon of the perturbation and observation method 
completely, and has the great superiority compared with other algorithms. 
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Abstract. For enhances the control method efficiency in the quality control, 
the causes the production process to be in the control area as far as possible, en- 
hances product the qualified rate. This article proposed quality control method 
based on data mining, and proposed the main body cooperates the data mining, 
introduced data mining's theory, the excavation process and the excavation me- 
thod, finally through the experiment, confirmed compared based on the data 
mining quality control method with traditional the statistical quality control to 
have the archery target enhancement, to a great extent raised the efficiency 
which the enterprise produced. 


Keywords: data mining, quality control, statistical quality control, main body 
assists, experiment analyzes. 


1 Introduction 


The quality control is to achieve the work technology which the quality requirement 
adopts and the activity. That is, the quality control is for through the surveillance 
quality forming process, eliminates on the quality link all stages to cause unqualified 
or not the satisfactory effect factor, achieves the quality requirement, the gain eco- 
nomic efficiency. Along with data warehouse and data mining technology starting, the 
people use from the massive quality historical data withdraw the knowledge to be able 
the better control online production, compared the based on data mining's method and 
original use the statistical quality control to have the archery target distinction, to a 
great extent raised the efficiency which the enterprise produced. The conventional 
routes already could not adapt under the new situation quality control, becomes the 
restriction enterprise development the bottleneck, therefore under the new manufac- 
ture pattern inquired about that the new quality control method, and realizes the high- 
er efficiency quality control to have the very vital significance. 


2 Data Mining Theory 


2.1 Data Mining Concept 
The data mining [1] is from massive, incomplete, has the noise fuzzily, in the stochas- 


tic practical application data, extraction concealment, the people do not know 
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beforehand, but is also the latent useful information and the knowledge process. Said 
simply is withdraws or “the excavation” from the mass data the knowledge. 


2.2 Data Mining Knowledge's Process 


The data mining process generally needs to experience; Data preparation, data min- 
ing, result expression and explanation. Data mining primary mission [2] May divide 
into two kinds: Description and forecast. The descriptive data mining portray data's 
general characteristic, the forecasting data mining is carries on the inference in the 
current mission, carries on the predict that like Figure 1 has demonstrated several kind 
of basic data mining duty: 


DBuscripiive 


Predictive 


© Chesification Repression (Description at) ( Cortelarion ) Cluster 
and Prediction thy cumeept at Aunlysis Aumalyxis 
elnss. \ 


Fig. 1. The duty of data mining 


Its use's main method has the classification and the predict and the return, con- 
cept/kind of description, the connection analysis, the cluster analysis. 


3 Quality Control Technology Research 


The quality control has experienced three stages approximately; first, the performance 
test stage, second, statistical quality control stage, third, Total Quality Management 
stage, corresponds to the above three quality control stage, the quality control tech- 
nology also develops from the most universal statistical quality control to based on 
the intelligence information processing control method, The quality control most 
main goal is enhances the control method the efficiency, the objective is enhances the 
control method the efficiency, causes the production process to be in as far as possible 
the control area, enhances product the qualified rate. What this article main research is 
the control technology, but non-quality control entire flow. At present mainly concen- 
trates to its technology's research in two kinds of technologies. 


3.1 Statistical Quality Control Technology 


Statistical quality control (SQC) is the present application scope broadest quality 
control method; its research is also mature. It uses the product the qualitative index 
to follow normal distribution this kind of rule to carry on the control. The control 
method is uses the control chart [3] general control chart principle as shown in 
Figure 2. 
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Fig. 2. The control of the figure 


The statistical quality control engineering research's primary coverage counts the 
technology, how to raise the statistical rate of accuracy to draw up the closer actual 
production process the control chart, uses the main method is the mathematical statis- 
tic method. 


3.2. Quality Control Technology Based on Data Mining 


The data mining is refers to from the data discovers usefully, hideaway pattern 
process. Advance which produces along with the enterprise, accumulates the massive 
historical data, based on data mining technology knowledge discovery characteristic, 
the people then apply this kind of technology in the quality control, unearths from the 
historical data has the value quality control information to use in the following pro- 
duction control, this is based on the data mining quality control technology. This kind 
of technology's core is the data mining technology. Has the essential difference based 
on data mining's quality control technology and the statistical quality control, it does 
not need to establish the control rule beforehand, the control rule withdraws directly 
from the quality historical data, avoided establishing not the flexibility beforehand. 
The data mining is one kind of relatively mature technology, at present the people 
mainly concentrate to this kind of controlling force method research in the data min- 
ing in the quality control application. In view of controlled member's characteristic, 
selects the appropriate excavation method, realizes the concrete quality control. 


4 Data Mining Method Based on the Main Body Assists 


The data isomerism which needs the domain expert in view of the tradition data min- 
ing to participate under the limitation which as well as the new manufacture pattern 
possibly exists the difficulty which brings for quality control, this article introduces 
the main body to solve this kind of problem. 


4.1 Definition of Main Body 


To solve in the knowledge integration the applied technology multiplicity, questions 
and so on unity and standard, the main body development has brought the hope for 
the question solution. Main body’s called the entity; it carries on the decomposition 
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to objective world's thing, discovers its basic constituent, then research objective 
things abstract essence. In recent years, the main body concept by more and more 
applications in the computer knowledge | two regulation domain, uses to the objec- 
tive world existence reality carries on the systematization to describe, convenient 
knowledge entrusting with heavy responsibility with alternately. 


4.2 Main Bodies Assist Data Mining 


The main body may assist the data mining from two aspects advance; first, based on 
excavation method main body [4]; second, based on excavation object main body [5]. 
based on main body data mining process as shown in Figure 3. 
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Fig. 3. Main bodies assist data mining process 


First, the data mining worker through obtains the data with the user exchange, the 
ultimate objective must unearth as well as anything is the related information which 
the user is most urgently needed knew. Second step, the restraint which conceals 
according to the user input, the main body which the data characteristic and already 
existed obtains all effective DM process set (each effective DM process is may im- 
plement carries out plan). in this in step including How to choose the suitable data 
pretreatment, how to choose the appropriate data mining algorithm and to carry on 
optimized, the visualization model operation to the excavation result. Then, the de- 
mand order which is urgently needed according to the user obtains to the second step 
institute carries out the plan set to carry on the arrangement, forms one to be possible 
to carry out the plan detailed list, Thus, the user may choose appropriately in this 
plan detailed list carries out the plan. Last step, chooses in the detailed list the plan 
and carries out. 


5 Data Mining in Quality Control of Assistance Based on 
Ontology 


Production activities, quality control is a very important part, only by improving the 
efficiency of quality control of products, the efficiency of enterprises in order to fun- 
damentally improve. Now widely used for enterprise statistical quality control (SQC) 
methods of abuse, combined with most of the current quality control based on data 
mining methods are still used after the extraction efficiency of control lag v. disease, 
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presents a more efficient quality control. And the general production process for the 
establishment of a quality control model. 


5.1 Quality Modeling 


Quality modeling to determine the substance of both control parameters, in general, 
the general production process of products is divided into three parts, one is the input 
part, that is, the production line of raw materials, quality control of the middle part, 
the last part of the output of the product, it is shown in Figure 4, the quality of the 
purpose of modeling is to determine how to set determine the QCM, but also how to 
set control parameters only makes the end product is qualified. This model is based on 
the quality of data mining, basic idea is to first select the object modeling of historical 
data, after preprocessing, and then extract the information contained and quality of 
knowledge related to the formation of control rules. 


Quality contral 
model 


ESSE Production Prod 
aw materials iin reducts 


Fig. 4. The quality of modeling diagrams 


5.2 Turning the Quality of Flow Based on Data Mining Thread 


Quality modeling method is using the first quality control of turning thread model, 
and through the model test. CNC lathe with CJK0635A object, CJKO635A turning 
screw is used in quality control of statistical quality control method, the parameters of 
the distribution of turning thread statistics, the parameters input to the Department in 
the production of CNC machine tools. 

It is using the quality modeling method, the union general turning thread's produc- 
tion line, carries on the knowledge excavation based on data mining's way to the turn- 
ing historical data, is as follows based on this kind of control thought's control chart: 


CRO lnihe 7 Cutting, Tatorteal 
be! parameters database 
» a 


Adjust tht parameters : 


Data Mining Madel 


Soccceeenesee: rad 
Tanwction ¥ 
control made 


Fig. 5. It is turning thread quality control system based on data mining 
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It uses the above control method, glass's processing parameter each time stores the 
historical database, applies the rough collection in the historical database's foundation 
to carry on the extraction to the control rule, The controlled variable feedback which 
obtains for lathe's control system carries on the reset to achieve again uses the new 
parameter to carry on the control the function. 


6 Experiment Analyses 


6.1 Data Selections 


Union this article proposed the theory method, the tentative data elects, from to cor- 
responds in the CJKO635A numerical control lathe's quality database data. The expe- 
riment take Microsoft Windows the Server2003 operating system as the platform, the 
database picks Microsoft Sqlserver 2000, the main body edition tool selects 
Prot6g63.1.1. The data inducts selects tool DTS, first establishes the ODBC data dri- 
ven for the data file, the data object is the DBC database file, but in system, so long as 
has installed the VFP software, has Visual FoxPro Tables the ODBC data pool, found 
windows the data pool supervisor, disposed the table of contents which the Visual 
FoxPro Tables table of contents was at to the data file. 


6.2 Data Mining Using Rough Sets 


In accordance with the method of rough set reduction of two-dimensional table, deci- 
sion rules expressed using the IF-THEN expression, the second column of Table 2. 
Training results in Table 1, the third and fourth columns. 


Table 1. Decision Rules 


No. _ Rules Detection record Percentage 
1 IF(FI=A16061)AND(F3=.2)THEN(D=1) 102 16 
2 IF(FI=A1 6061)AND(FS=.017)THEN(D=1) 91 15 
3 IF(FI=AI7075)AND(FS=600)THEN(D=1 125 20 
4 IF(FI=Al 7075)AND(FS=.005)THEN(D=1l) = 75 12 
5 IF(Fl=Fe 7 17 
4140) AND(F2=1200)AND(F8=300) 
THEN(D=O) 
6 IF(FI=Al 6061)AND(F8=600)THEN(D=O) 8 20 


6.3 Assessment 


Select from the historical data 333 (19 a defective) of data as test data, the results 
shown in Table 2. It can be seen. And objective decision making model the pass rate 
has been very similar, reflecting the good quality control results. 
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Table 2. Models the Classification Capability of the Test Record 


Test set Test Project 
333 data (314 Decision Rules 1 2 3 4 5 
qualified, 19 
defects) ae to feature F1,F3 F1,F5 F1,F8 F1,F5 F1,F2, F8 
se 
Percentage 16% 15% 20% 12% 17% 
Pass rate model 94.9% 100% 70.1% 88.9% 94.6% 
Pass rate 95.4% 100% 11% 89.3% 95.3% 


This paper selects the control and the present model of statistical comparison of 
control results in Table 3.The model established by this qualified products and subs- 
tandard products on the classification accuracy rate is significantly higher than in 
SQC, and the fluctuation range of smaller than the SQC. 


Table 3. Quality Control Methods and Compare the Effect of SQC-based Data Mining 


Control Test Training Features Accuracy Error rate 
data data re-centralization 

SQC 8/3333 no no 82.1244.4777 17.88+4.47 

This 3/(667+333) yes no 95.05+0.8333 4.96+0.83 

model 


7 Summaries 


In this paper, quality control, the introduction of the data mining technology, and 
proposed data mining based on ontology, describes the process of its implementation, 
Validated by experiment based on data mining for the quality control compared to the 
original use of statistical quality control has a qualitative difference, it has greatly 
improved production efficiency. 
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Abstract. Two new switched-current (SI) memory cells suitable for high-speed 
digital communication circuits are introduced. A kind of novel BiNMOS tech- 
nology is introduced in the first switched-current memory cell, and a high-speed 
fully differential BiCMOS structure is introduced in the second switched- 
current memory cell. Besides, measures of deducing the delay time formulae, 
optimum seeking the Parameters, and reducing power consumption have been 
carried out. All the results of hand calculations, SPICE simulations and mea- 
surement indicate that with a bipolar supply of 2.0 V~4.0 V, the proposed cells 
achieve a good comprehensive properties index-delay-power product (DP). The 
DP of the proposed cells achieves 12.9 pJ, which is about 16.5 pJ lower than the 
DP of CMOS SI memory cell AD585. Applications of the two new cells in a 
high-speed digital communication circuits are demonstrated. 


Keywords: switched-current memory cells; BiCMOS technology; delay-power 
product; high-speed; digital communication circuits. 


1 Introduction 


Switched current (SI) memory cells (known as current copier cells [1]) initially con- 
ceived to overcome the inherent matching limitations of continuous time current mir- 
rors [2, 3] are the simplest analog memory building blocks that can be realized in 
purely digital CMOS technologies. They became basic building blocks for many use- 
ful circuits such as dynamic current mirrors, accurate current dividers, A/D and D/A 
converters [4, 5]. So there is considerable interest in the research of SI memory cells. 
A low-power low-mismatch low-glitch class AB first-generation switched-current 
memory cell is presented in [6], and a low-voltage, low-power, low switching error, 
class-AB SI memory cell is proposed in [7]. The circuit decomposes the input signal 
into two components by a low-voltage class-AB current splitter and subsequently 
processes the individual signals by two low switching error class-A memory cells. But 
all the designs above adopt only CMOS technology. The CMOS devices always keep 
on alternatively under the static state. So the drive current only exists in the switching 
of movements. Thus, the switching speed of the current is very low. In order to meet 
the demand of high switching speed in information systems of digital communication, 
two kinds of novel high-switching speed SI memory cells are proposed. Combine the 
advantages of BJT with CMOS, a BiCMOS technology to improve the switching 
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speed and power consumption is adopted, because the switching speed of BJT devices 
is much higher than the CMOS devices but the supply voltage and the power con- 
sumption are very larger than the CMOS devices. 


2 Designs of BINMOS/BiCMOS SI Memory Cells 


2.1 BiNMOS SI Memory Cells 


The complete circuit schematic of 
the proposed BiNMOS SI memo- 
ry cell is shown in Fig.1. In which 
VT, and VT, are switching devic- 
es, VN3, VN, and VN;, VNe form 


Mos | aay: two CMOS differential pairs re- 
YR 7" os spectively. If the circuit is in the 
ai sampling mode, VT,;, VN, and 
VN, will turn on, and the current 
i, will charge the CG through VT). 
This time the grid-to-source vol- 
tage Ugs of VNg rise. When value 
of ugs beyond the threshold voltage of VNo, VNo turns on. This time the output cur- 
rent ig flows across VT7 and VNo, and ig~K;,I (where K is the proportional coefficient, 
and 1.4<K<2.5). Thus i; is slightly amplified. If the circuit is in the holding mode, 
VT», VN3 and VN; will turn on, and the potential of VT,’s base ug, is lower than the 
value of ugs+ ugg. SO Up; is lower than the potential of the emitter. Then the VT; 
turns off and holds on the io. 


Fig. 1. BINMOS switched-current memory cells 


2.2. Differential BICMOS SI Memory Cells 


The complete circuit schematic of 
the proposed BiCMOS SI memory 
cell is shown in Fig.2. In which the 
differential pair is powered by the 
current source Jp, and Jp.. As the 
differential pair has a fast switching 
speed and the currents can be added 
up easily, so the dependence of the 
differential pair’s output current on 
input current is weakened, and the 
feedthrough of the holding mode 
Fig. 2. Differential BiCMOS switched-current ecreases. When the circuit is in 
memory tracking state, ugg of all the BJTs are 

equal, and the collector current of all 
the BJTs are also equal. In Fig.2 the differential currents i,, and i. are the functions of 
input voltage u; and u;' respectively. When the circuit is in the holding mode, the drain 
currents of VN,' and VN, are added up, and the drain currents of VN,' and VN, are 


{i =)g-+(PAD = 


as Vop 


ij- =hp- - U- ADS 


. sky 
ay 


io- 
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added up. Then i;, and i, are independent of uw; and u', while i9,~Kij,, ip _~Kiy. 
(where K is the proportional coefficient, and 1.4<K<2.5). 


3 Measures for Speed Increasing and Voltage Decreasing 


3.1 The Deduction of the Time Delay Formula 


The formula of average time delay tPD is deduced by analysing the circuit schematic 
of BiCMOS SI memory cell in Fig.1. After being analysed, the formula (1) and (2) 
can be written to express the time delay tPLH and tPHL in the pull-up and pull-down 
process of SI memory cell respectively: 


tpru~Cp Upe/Ipiat ae [Cp Co/( Cp + Co)] Vpp/(2Ipie)- (1) 


teat=CpUget/Ipu =F [CgCoM(Cp + Co)] Vpp/(2Ipuzz) - (2) 


Where Cg and Ugg) are average stray capacitance of the base-to-ground and voltage 
drop of the base-to-emitter in WT, respectively, Cg and Cp are average value of grid 
capacitance and drain capacitance of NMOS devices respectively. Vpp is the power 
supply. Jppy; and Jppy2 are average charging currents in the pull-up process. Jpy,, and 
Tpy_2 are average discharging currents in the pull-down process. It can be seen from the 
formula (1) and (2) that the speed of the SI memory cells is determined by charging 
and discharging of the electrode capacitance in pull-up and pull-down processes. Be- 
cause the connection mode of BiCMOS circuit’s electrode capacitor in Fig.2 is similar 
to the connection mode of BiNMOS circuit’s electrode capacitor in Fig.1, the delay 
time formula deduced from the Fig.2 is similar to the formula (1) and (2). 

Besides, it can be seen from the formula (1) and (2) that Vpp is a part of the numera- 
tor in the second term, so the lower Vpp can make the time delay smaller. 


3.2. Contractive Analysis in the Time Delay of the Two Memory Cells 


The SI memory cell in Fig.1 is also the object of research so as to simplify the analy- 
sis. When the switch devices VT, and VNg are in the turn-off state, then the current 
conducting channel will does not exist and the total equivalent capacitance will be 
determined by Cg. When the VT, and VNsg are in the transition state, the inversion 
layer formed in VNg act as the channel between drain and source. So the substrate is 
shielded by the channel at the terminal of grid, and Cg=0. When the VT, and VNg are 
in the turn-on state, the channel of VNo is pinched off and the capacitance between 
grid and source is almost zero. The capacitance Cg between grid and substrate is also 
about zero. Because the value of Cg in the three states above is about zero and Cg is 
in series with Cg, the equivalent capacitance is also about zero and the formula can be 
simplified as follows: 


toty=CpUpei/Ipun- (3) 


tput~CpU; pet/ pu. (4) 
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So the time delay fpyy and fpy, in the pull-up and pull-down process are determined by 
Cp (because of Ugg; =0.6 V). Compared with the CMOS SI memory source AD585 
(in sample-hold mode), in AD585, the circuit where Cg is in series with Cg does not 
exist, but there are several Cg connected in parallel. On the contrary, a topology of the 
circuit in which Cg is in series with Cg is included in the proposed SI memory cells. So 
the time delay of proposed SI memory cells is very small according to the estimating 
formula above. Then the estimating formula of the average time delay fpp is that: 


tpp= 


(tout tou)/2. (5) 


3.3 Optimizing of the Parameters of the Devices 


Table 1. Parameters of the BiCMOS SI memory cell According to the formula (3) 


and (4), the optimal parame- 


Technology/Transistors Parameters ters in BiCMOS SI memory 
Technology 0.35 um BiCMOS cells are shown in Table 1. 
Beak te Double well CMOS+NPN The purpose of optimizing 

poly-silicon emitter transistor the parameters of the devices 
BJT Ag=0.85 X 4.0 m2 is to guarantee the low time 
fr=15 GHz delay and small power con- 
ele sumption of the proposed 
Renae BiCMOS SI memory cells. 
Rp=53 0 
f=20~30 


MOS device 


0.42 um (NMOS) 


0.50 pm (PMOS ) 


Ury (according to VDD) 
Tox=100 nm 


3.4 Preparation of the Devices for Speed Improving 


BJT devices: The key of the 
emitter technology is to maintain 
the enough high current gain and 
reduce the RE;The extended 
pressure results in increasing the 
Re under the high forward cur- 
rent density, so the reliability of 
the emitter should be taken into 
account; The technology to re- 
duce the capacitance Ccg of J. 
include adopting double-well 
polycrystalline silicon emitter 
structure, reducing the junction 
area of extrinsic J, and reducing 
the doping concentration of the n- 
type S; below the extrinsic J,. 


t (ns) 


Fig. 3. The Input-output transfer characteristics of 
the tow kinds of BiCMOS switch SI memorie 
cells: BINMOS memory of Fig.1 (K=1.9, sample 
mode); Differential BICMOS memory of Fig.2 
(K=1.9, sample mode) 
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CMOS devices: Si surface silicified treatment technology is adopted to reduce the 
sheet resistance and enhance the ohm contact; In order to control the short-channel 
effect, an injection technology which adds the halo on nonuniform (shrinking) channel 
is adopted; The shallow trench isolation process with minimal space consumption is 
used to isolate the CMOS devices; Reducing Rps, Cp, tpty and tpy, at the same time is 
carried out according to the formula (3) and (4). 


4 Experimental Results and Discussions 


4.1 Date of the Simulation and Hardware Experiments 


Simulation and hardware circuit experiments for the proposed BiNMOS/BiCMOS SI 
memory cells are implemented respectively, and the hardware is implemented in Jiang- 
su electronics key laboratory. The tools for simulation and hardware circuit experi- 
ments are PSpice8.0 and Agilent electronic test instrument respectively. The simulation 
waveform which reflects the relationship between the input and output is shown in 
Fig.3. It can be seen easily from Fig.3 that the time delay of output signal relative to 
input signal is about 2-3 ns, and the waveform of output signal always follows the in- 
put signal without any distortion. Table 2 compares the proposed BINMOS/BiCMOS 
SI memory cells with previous CMOS SI memory cell (taking the AD585 as the repre- 
sentative.). The test condition is that all the SI memory cells work in a sample-hold 
mode. The result of hardware experiment shows that the both proposed IS memory 
cells can work in the power supply Vpp=3.1 V, the time delay tpp has reduced to 2.0 
ns-3.1 ns and the power consumption Pp has reduced to less than 4.7 mW. 


Table 2. Simulations and experiments results of CMOS/BiNMOS/BiCMOS memories (f=60 
MHz, VDD=3.1 V) 


Ay. power 


consumption Avy. propagation Delay-power 
The different delay product Drive 
: PD at 100 MHz 
experiments /mW tpp / ns DP/pJ current 
memories ———_______? oa mmm sig / MAA 
Sim. Exp. Sim. Exp. Sim. Exp. 

CMOS memory 
of AD585 1.1 1.2 26.7 27.3 29.4 32.8 0.1 
FOS 4.0 4.1 2.0 23 8.0 94 14 
memory of Fig.1 
Pe es 4.6 47 2.8 3.1 12.9 14.6 15 
memory of Fig.2 


Sim.: Simulation; Exp.: Experiment. 


5 Relationship among Delay-Power Product DP, Power Supply 
Vpp and Time Delay tpp 


From the analysis in [8] it is easily to notice that the delay-power product DP reflect 
the comprehensive performance of circuit. In order to compare the performance of the 
CMOS, BiNMOS and BiCMOS SI memory cells, the relation curves between DP and 
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Vpp of the three memory cells have been carried out. The result in Fig.5 shows that 
under the same condition of Vpp=2.0 V~4.0 V, the DPs of the BINMOS and BiCMOS 
SI memory cells are deeper than the CMOS SI memory converter. When the Vpp is 
about 3.04 V, DP of the three SI memory cells achieve minimum value. But when Vpp 
is less than 2.0 V or more than 4.0 V, the DP increases gradually, and the advantage 
goes down. It can be seen clearly that when 2.0 V<Vpp<4.0 V, DP of this BiN- 
MOS/BiCMOS SI memory cells are about 21.4 nJ and 16.5 nJ deeper than CMOS SI 
memory cell. All the results show that the designed SI memory cells have some advan- 
tages both in speed and in the comprehensive properties index DP. So they belong to 
high speed and high performance SI memory cells. 


6 Conclusion 


Two BiCMOS SI memory cells were introduced and analyzed. The advantages in high 
speed and low power supply of the BINMOS/BiCMOS SI memory cells are shown in 
Pspice 8.0 and hardware experiments. The reasons are that: The parametric optimiza- 
tions of the components have been worked out; the complementary advantage between 
single-polar and bipolar devices is worked out through adopting the BiCMOS technol- 
ogy. The experiments show that the proposed SI memory cells can work under the 
power supply of 2.0 V<Vpp<4.0 V, the time delay tPD of it is 24.2 ns smaller than sim- 
ilar CMOS SI memory cell AD585, and the power consumption PD is only 3.5 mW 
larger than that of AD585. So the DP of the proposed SI memory cells is about half of 
AD585’s and the drive current come from the output of it is about 14 times larger than 
that of AD585. 
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Abstract. This paper presents a novel high-linearity and low-consumption 
BiCMOS frequency-to-voltage (F/V) converter which is composed of three 
operational amplifiers (OAs) Al, A2 and A3, in which Al and A2 are 
common-source CMOS OAs, but A3 is a low-pass filter (LPF) which adopts 
the BiCMOS technology. The component parameters of the entire converter 
are optimized and some measures such as raising speed and reducing 
power-consumption are taken. Experimental results show that the pulse signal 
frequency f2 input into the LPF of the converter is equal to the input signal 
frequency fi, and the output average voltage (Uo) of the converter varies directly 
with fi. This converter is tunable from 4 Hz to 10 kHz, and its delay-power 
product (DP) is about 1.09 nJ, and its conversion linearity (LC) is only 1.7X 
10-2. With these characteristics, the presented converter is very suitable for 
low-frequency signal processing systems. 


Keywords: BiCMOS analog integrated circuits; frequency-to-voltage converter; 
high-performance; operational amplifiers. 


1 Introduction 


A F/V converter is a device that generates an output voltage proportional to the fre- 
quency of an input signal. It can convert a signal from alternating frequency to alter- 
nating voltage linearly. This device is very useful and has many applications in power 
system control, in processing of very-low-frequency signals, and in many fields of 
instrumentation. Because of the strong anti-jamming capability of frequency signal, it 
is also often used for long-distance transmission. It can be modulated in the RF signal 
for wireless transmission. Usually, the F/V converters are designed to comply with 
several requirements such as high speed operation, low-output ac ripples, good linearity 
and wide frequency range [1~3]. Besides, the application and the principle of a series of 
integrated F/V converter chip such as LM131, LM2907 and LM2917 were presented in 
[4~6]. Another new F/V converter which was used to generate coded signals for 
AC/DC servomechanisms was designed in [7]. Another kind of F/V converters can 
produce a voltage that is proportional to the frequency of a sinusoidal wave form input 
signal as proposed in [8]. Besides there are three kinds of high-performance F/V con- 
verters presented in [9~11]. In order to meet the performance requirements of intelli- 
gent multi-parameter high precision measurement and the high-performance fiber-optic 
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transmission system, a new F/V converter which has low-consumption and 
high-linearity based on BiCMOS technology is proposed, and the simulation and 
hardware circuit experiment are carried out. 


2 BiCMOS F/V Converter 


The complete circuit schematic 
of the BiCMOS F/V converter 
composed of A;, A» and A; is 
shown in Fig.l. The voltage 
comparator composed of Al 
convert input signals into 
square wave signal with the 
same frequency, and then 
transmit the narrow pulse with 
positive pulsatile capability to 
A) through the capacitor C; and 
the diode VD3. Because the 
potential wy in the inverting 
input is negative before trig- Fig. 1. BiCMOS F/V converter 

gering; the potential in the 

output of A, is positive; the VN,, VN; is turning on and the potential uw, is negative. 
When the voltage +Vpp in the non-inverting input trigger A and shift it to low level 
swiftly, VN, is turned-off and uz jumps to high level which is equal to the stable voltage 
Uz of the silicon stabilivolt VS. At the same time uy jumps to the high level Uy cor- 
respondingly; then the VN, is turned-off and +Vpp charges the capacitor C through the 
resistance R. So the voltage up in the non-inverting input of A, rises following the 
exponential law. After the time of Ty, when the voltage up increases to Uy, the voltage 
of A, reverse swiftly again to reset. Then the process of the monostability is over. Thus 
the drain voltage which is the voltage pulses with the width Ty and the amplitude Uz 
increases with the rise of the input frequency f,. According to the principle of super- 
position, when VN; turns off, the potential in the inverting input of A, is following: 


Uy=R,U72/(R, + R2)— Ro Vpp/(R, + R2). (1) 


When the charging interval is up to Tw, up(Tw)=Uy. Therefore, after calculation, the 
time charging to Ty of the RC circuit is reached: 


Tw=RC In{[ R/(R+ Ro) ]x[(Ri + Ro) Vp] (Ri + Ra) Vop— (Ri Uz— Ro Vop)]}- (2) 


In Fig.1, the low-pass filter (LPF) formed by A; is also can be considered as voltage 
follower. So the average voltage in the output of the whole converter is follows: 


U.=TwU zfs (3) 


The formula(3)shows that the U, of the converter is proportional to the input signal 
frequency jj, and the ratio of coefficient is TyUz. 
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3 Measures for Speeding Promoted, Power Consumption Reduced 
and Linearity Improved 


3.1 The Improvement of the Circuit 


In Fig.l VD, and VD, in the input of the F/V converter protect the circuit by limiting 
the voltage. The R; and R, accelerate the jumping of the voltage by inducing the posi- 
tive feedback in the comparator A;. Because of the backlash, the capacity of resisting 
disturbance of the comparator A, is stronger than other comparator. A3 can be consi- 
dered as the voltage follower. So the series voltage negative backward is induced to 
increase input resistance, reduce output resistance, broad band, restrain interference 
and noise. 


3.2 Optimizing for the Parameters of Elements 


The supply voltage +Vpp=+6 V; the accuracy class is +0.001; and the stabilized 
voltage of VS is 4 V. The silicon reference diode with lower temperature coefficient 
should be selected. The principles of selecting the parameter of RC-components are 
higher speed, higher linearity, and lower power consumption. After the selective pre- 
ference, the metal film resistor which belong to the E24 nominal value series with error 
+5% and tantalum electrolytic capacitor which have a mall leakage current are worked 
as the RC-components of the converter in Fig.l. The parameters of every 
RC-component are that R;x=R,z=10 kQ, Ro=R3=Rs=R7=Rg=Ro=R yQ=R=Ri2 
=1kQ, Re=0.1kQ, R2=75Q, Cy=C,=0.1 pF. 


3.3. Optimizing for the Preparation of the Semiconductor Devices 


CMOS devices: () Electrode is made by 
adopting an impure and self-aligned ion im- 
plantation. @) In order to reduce the sheet re- 
sistance and strengthen the point-pressure ohmic 
contact, siliconizing is using to the surface of the 
devices. (3) Non-uniform channel added with 
halo injection is adopted. @ Shallow trench 
isolation process with minimal space occupation 
is used to insulate the MOS devices. 


BJT: © Polysilicon emitter bipolar BJT pair 
transistor with ultra shallow junction is made. Fig. 2. Microphotograph of the BiC- 
The depth of E junction is scaled down. The MOS F/V converter 

width of the base and the base transit time of carriers are reduced. @) The key to the 
technology of emitter is that an adequate size of the current gain should be maintained 
and RE should be reduced as small as possible. @) Considering the reliability of the 
emitter contact, the contact area between the base and emitter is made small. 


Layout: The standard 0.18 um BiCMOS technology which provide six layers of metal, 
diversified BJT and MOS models, poly resistors and MIM capacitors is adopted in the 
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whole chip. Fig.2 is the microphotograph of the BiCMOS F/V converter. The voltage 
comparator composed by A, and the monostable trigger composed by A, are in the 
below two-line boxes of the layout. However the LPF composed by Aj is in the upward 
side of the layout. Excluding the weld spot, the whole circuit occupys an active area of 
160 pmx120 um. 


up (V) 


Fig. 3. Waveform of key points in the voltage test. (a) Input voltage of F/V Converter, 
(b) Output voltage of comparer A>, (c) Input voltage of LPF, (d) Voltage in the anti-phase input 
of A,, (e) Voltage in the in-phase input of A, 


A Novel High-Performance BiCMOS F/V Converter 529 


4 Discussions and Analysis of Experimental Results 


4.1 Voltage Waveform of Points in F/V Converter 


The proposed BiCMOS F/V converter has been implemented in a standard TSMC 0.18 
um BiCMOS technology. The tools for simulation and hardware circuit experiment are 
PSpice8.0 and Agilent electronic test instrument respectively. The digital oscilloscope 
is the Tektronix TDS5034B and the signal generator is E4438C made by Agilent 
Company. At the beginning of the experiment, alternating sinusoidal voltage 
ui=3sin(1000zt) V is input. Then the key points of the circuit are measured, and the 
waveforms are obtained. Fig. 3 shows that when the whole F/V converter input a cycle 
of sinusoidal signal, the monostable trigger A, will output a positive pulse with pulse 
width (7w=0.7 ms). This work moves in cycles. Then the drain of VN3, which is also 
the input of LPF, will obtain a sequence of positive voltage pulse u2. The number of the 
pulses in the pulse sequence is equal to the number of the circles of sinusoidal voltage 
u;. That means the input frequency f; of the pulse sequence pushed into the LPF 
equaling to the frequency fi of the input signal is 500 Hz. According to Fig.3(c) and 
Fig.3(e), it can bee seen easily that Uz~3.90 V, and Uy~3.02 V.These data are consis- 
tent with the analysis in the section 2, which shows that U2 in the circuit has a good 


sensitivity and accuracy. So like the Uj, the voltage uz also can be used to connect load. 


4.2 Relationship between Frequency and Voltage 


Fig.4 shows that the measured the aver- 
age U, of three output voltage versus the 
input signal frequency f, under the con- 
dition that the temperatures are 60°C, 27 
°C and SC respectively. To make it easier 
to read, f; in the Fig.4 is expressed by 
abscissa and the decibels of the voltage 
ratio U/Ug is expressed by ordinate 
under the condition that Up=1 V, which 
means the operation 201g(U,/Up) is made : : : 
on the ordinate. The measured three 0 10 10° 10° 10° 


201g (U,/Up) (dB) 


curves show that the converter can work f, (Az) 
normally in the frequency range from 4 
Hz to 10 kHz. U, following with f, grows Fig. 4. Relationship between U, and f, 


linearly. However, the three linear curves 

show that the converter can work nor- 

mally in a wide temperature range (5’C ~60°C) and eliminate the impact on the linearity 
of the converter from temperature variation. 


4.3. Date of the Hardware Circuit Experiment 


Tab.1 shows the experimental date from the three times testing of the BiCMOS F/V 
converter. So it is easy to see that though the arithmetic mean value of the time delay tpp 
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Table 1. Main date of this BiCMOS F/V converter 


No. of tests First Second = Third AV 

3 dB(kHz) =10 =11 =9 10 

Linearity Lyp(x10—) 21.8 =1.7 =1.6 1.7 
Pp (mW) 23.6 23.9 24.5 24.0 
tpp(ns) 45.3 45.1 45.7 245.4 


is 45.4 ns, the power consumption Pp is only 24.0 mW. Then the aggregative perfor- 
mance indicator we call it the arithmetic mean value of delay-power product (DP) is 
that: 


(23.6x45.3 + 23.9x45.1+24.5x45.7) +3=1.09 nJ. (4) 


The 3 dB bandwidth means the frequency is about 10 kHz. In order to see clearly, the 
curves reflecting the relationship between DP and supply voltage Vpp of the bipolar, 
CMOS F/V converter in [8, 9] 
and this BiCMOS F/V converter 
are plotted by experiment in 


5.0 


> This BiCMOS F/V converter 


Fig.5. The experimental result 4.0 as The BIT F/Y converter in [8] 
can be achieved only on the -o. The CMOS F/V converter in [9] 
condition that the voltage range 3.0 


is from 2 V~8 V. All the results 
show us that the designed 
BiCMOS F/V converter has 
advantages in the aggregative 
performance indicator DP. 


DP (PJ) 


5 Summary 


A kind of high-performance F/V 
converter is presented. Both 
simulation and hardware circuit 
experiment show that this con- 
verter has obvious advantages 
both in power consumption and in speed. The reason is that 2) BiCMOS technology is 
applied in this paper. The CMOS devices form the main part of the converter. The two 
BJTs are only used in output stage which is composed of LPF operational amplifier, 
and they turn on alternately as pull-up and pull-down devices respectively. @) The 
parametric optimization of the components have been worked out. The positive pulse 
sequence uy with the width Ty and magnitude Uz is generated by BiCMOS F/V con- 
verter, and the value of its output voltage Uz varies directly with the input signal fre- 
quency f;. The pulse sequence u2 can connect directly with the interface of computer. 
This way can take the place of the ADC and occupy less hardware resources. In addi- 
tion, there is a hysteresis loop in voltage comparator. Because of these reasons, the 
designed converter has a strong anti-interference ability. 


Fig. 5. Relationship between DP and Vpp of the 
three F/V converters 
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Abstract. A novel 0.15 um BiCMOS A/D optoelectronic converter (OC) was 
designed, which is composed of optoelectronic receive devices and a Schmitt 
A/D OC. To improve the gain linearity, accuracy and stability of this proposed 
A/D converter, negative feedback and bandgap reference were introduced in this 
circuit, respectively. Additionally, component parameters were optimized, which 
can enhance the performance of the whole system. The layout of operational 
amplifier (OA) was designed and its chip area was 0.42 mmx0.32 mm. The si- 
mulation and hardware circuit experiments were also given. Test results show 
that the photocurrent increases with exponentially regularity followed by in- 
creasing of the laser power. The converter can use 3.3V power supply, -3 dB 
bandwidth is about 58.4 kHz and the power consumption is about 69 mW, so the 
proposed converter can meet the requirement of low-frequency displacement 
optoelectronic control systems (OCS). 


Keywords: BiCMOS; Optoelectronic receive device; Bandgap reference; 
Schmitt A/D converter circuit. 


1 Introduction 


With the advantage of the fast arithmetic speed, high reliability and the powerful 
function for data processing, storage and control, PC is widely used in optoelectronic 
measurement and control system, especially in the industrial process control and pro- 
duction lines. As computer can only receive the digital signal "0" or "1", while the 
luminous intensity of the OCS reflects variation of the physical quantity (displacement, 
rotate speed and pressure etc.), and optoelectronic devices (ODs) convert physical 
quantity changes to analog signal, therefore, a interface circuit—A/D converter is 
needed between the ODs and the computer. For example, when producing armor plate, 
a steel company want to make the plate stack neatly, it uses photoelectric control 
system based on plate edge location. When the measured plate swung to the left or right 
edge, the output of photoelectric converter is "1", otherwise is "0". If the plate speci- 
fication is different, it also needs to adjust the displacement in the length direction. So 
this case can not only reflect the variation of the displacement and its direction, but also 
require the A/D conversion for photoelectric signal to control the stacked position of 
plate accurately. Obviously, the luminous intensity and stability of photoelectronic 
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devices directly affect the measurement error. Additionally, the bright-dark of the 
workshop brightness will also affect the detection of plate edge. So it requires to 
manufacture a high stability A/D OC. 

According to the published literature, it has not been reported about the design of 
high stability, low power A/D OC[1~6]. With the characteristics of BiCMOS tech- 
nology—low power consumption and high integration, a novel 0.15 um BiCMOS 
Schmitt trigger circuit A/D OC is present. 


2 Design of Schmitt A/D OC 


2.1 Laser Single-Channel Interference Measurement Systems 


In Fig. 1(a), firstly, He-Ne laser with frequency stabilization circuit emits beam, when 
the incidence beam reaches to the viewfinder M, (splitter), it is divided into two beams, 
one is the reference beam (S)), the other is the measurement beam (>). These two 
beams are reflected back to M;, through total reflection mirror M>, M3; respectively, and 
it produces interference beam ($3) in the location of Mj, then the interference fringes 
are received by ODs VD;, VDy, at last pulse analog signal is formed, so the displace- 
ment information of M; can be carried in this analog signal. Fig.1(a) shows the prin- 
ciple of laser interference measurement (LIM) device: Mp is fixed and M; is installed 
on the movable measuring head, according to the linear relationship between the 
movement amount of M; and the length of detected object, the length of object can be 
measured; The optical length of reference beam is not equal to the measurement beam, 
when the optical length difference 4 is integer multiple of the wavelength A, i.e., when 
A= +K/ (K=0, 1, 2,...), the two beams are in phase, now the light intensity is maxi- 
mum, when M, emits bright spots, ODs can receive this bright spot signal; when A= 
+(K+1/2)A, the phase difference of this two beams is 2, and the light intensity is zero, 
when M, emits dark spots, without light signal incidence, the output signal of optoe- 
lectronic receiver is zero; When the movable mirror M; moves along the optical axis 


VD, 


| <7 


interference light intensity 


right-shift 


| | VD VD, 


S3 two-way mobile interference fringes left-shift 
vas ZRN ODs 
(a) Principle of LIM device (b) Direction of interference fringes 


Fig. 1. Single-channel LIM systems 


Novel 0.15 tm BiCMOS A/D Optoelectronic Converter with Schmitt Trigger Circuit 535 


of measurement beam, There will be alternating bright-dark interference fringes in the 
location of M,, so the variation of luminous intensity Jy is 


Iy=Iov tov K,cos(2nA/A). (1) 


Toy is the average light intensity, K, is the interference fringe contrast. From above formula, 
when J changes A, the bright-dark of interference fringe will change a cycle. If interference 
fringe changes n cycles, then A =n xA, interference measurement device as Fig.1 shows, A is 
twice as long as the displacement L of M3, and the measured displacement: 


L=nN/2. (2) 


so if the change cycle of interference fringe is calculated, then the move length of 
measurement head will be known. Fig.1 (b) schematically shows the interference fringe 
direction. The bright-dark of interference fringes interchanges with the movement of 
Ms, it is equivalent that interference fringe is moving. So the output signal of VD), VD 
is approximately sine wave; while the two narrow gaps make phase difference of output 
sine wave of VD,, VD, reach 7/2. If the interference fringes moves to the right, then the 
phase of output sine wave of VD, is ahead of VD, 7/2; if interference fringes moves to 
the left, then the phase relationship is contrary. 


2.2 Schmitt A/D OC Circuit 


As is shown in Fig.2, VD; (VD2) is followed by A/D OC circuit, VD; (VD2) is con- 
nected to source output device which is composed of VT, Rp, Rg and Ro. The latter 
outputs analog voltage signal u; which is linearly related to the luminous intensity of 
ODs; uy is added to the inverting input of OA A. Bandgap reference composed of R;, Ro, 
R3, Qi, Q2 and A; outputs reference voltage Up, which is added to the non-inverting 
input of A through Ry; A, Ry, Rs, Re, R7 and two-way regulator VDz etc. constitute a 
Schmitt trigger; positive feedback was introduced through Ry, Rs, the purpose is to 
accelerate hopping rate of the output digital voltage uo, and it makes the voltage 
transmission characteristics have hysteresis, in order to improve the anti-jamming 
capability of the circuit; VDz can restrict the amplitude, it makes positive and negative 
amplitude of uo within +U7z, after derivation, the threshold voltage of this trigger is: 


Urn =(tR4Uz+RoUp)M( Ra tRo). (3) 


obviously, this trigger has two threshold voltage which are variable. 


Fig. 2. Variable threshold binarization processing circuit 
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Fig. 3. Internal circuit of the BICMOS OA A 


2.3. Key Design for BiCMOS OA 


Fig.3 gives internal circuit of the BiCMOS OA A. It is a differential BICMOS OA[7]. 
Source-coupled differential input stage was composed of VN, and VN,; com- 
mon-source voltage amplification was composed of VP3, and C is a vibration capa- 
citance, VT, and VT, are complementary push-pull driver stage. Feedback network is 
constituted of external resistance Rg and Rp, high input impedance and broadband 
properties are achieved through differential input stage using insulated gate MOS 
devices. Push-pull output not only reduces the output impedance, and increases 
bandwidth[8]. Voltage parallel negative feedback was introduced in Rp. All MOS 
devices in OA can use the selected BiCMOS parameter from References[9] in Table 1. 


3 Design for BiCMOS OA Chip 


OC circuit can be achieved by 0.18 um BiCMOS 
process using the standard of TSMC company. The 
preparation method is: OA A is made into a chip 
singly, or OA A and source-output device are made 
to co-produce a ASIC. For the source output devices 
fl is influenced great by ODs, the design for layout of 
OA A is finished firstly in the Cadence environment, 
then the parasitic RLC parameter extraction and 
post-simulation experiment is carried out, The results 
show that gain-bandwidth product of circuit declines 

Fig. 4. Design for layout of OA for influenced by parasitic parameter of circuit, so it 

; needs to re-adjust RC parameters and optimize 
layout-wiring. For example, the function block is placed in a reasonable position, so the 
connection line between each other can be the shortest; the output line is separated from 
feedback connection to reduce high-frequency interference between them, double-layer 
metal wiring is used and so on. The chip area of A was 0.42mmx0.32mm in Fig.4. 
Similarly, the layout of single chip ASIC can be designed and the size is a bit larger 
than OA A. 
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4 Simulation and Hardware Circuit Experiments 


4.1 Design for BiCMOS OA Chip 


i / (uA/cm?*) 
N 


0 a ae ae 
pi / (mW/cm*) 


Fig. 5. Curve relationship between current 
density i and incident optical power p; 
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(a) the input voltage uy; 


Fig.5 shows the curve relationship be- 
tween current density i and incident 
optical power p; from 1 mW/cm’ to 5 
mW/cm’, using 570 nm incident light 
wavelength. The change trajectory of 
photocurrent i of VD; and VD, is almost 
same from the Figure, They are in ac- 
cordance with exponential increase with 
the increase of p;, which shows that once 
VD, or VD, exceeds the threshold vol- 
tage, even if absorbing only a bit optical 
power, photocurrent will also increase 
sharply. 
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(b) the output voltage uo 


Fig. 6. Measured waveforms of the u; and uo 


4.2 Input and Output Waveforms of Schmitt A/D OC Circuit 


Firstly, the input and output voltage test is carried out for OC circuit using the above 
method, and the supply voltage +VDD is +3.3V in test, the external resistor of source 
output device and schmitt trigger are: Rg=Rp=R7=4 kQO, Rp=100 OQ, R5=Rp=2 kQ, 
R4=1.5 kQ, the RC parameter of OA are: C=10 pF, Rp=10 kQ; two sets circuit as 
Figure 2 shows are used to test the results, the quasi-sinusoidal signal frequency ex- 
ported from optoelectronic receiver takes 100 Hz, the reference voltage of OC: Ur=0 V, 
VDz is connected with two silicon voltage-regulators 2C W103, and the voltage is 
restricted at +2.6 V. MATLAB soft is used to get the waveform of input and output 
voltage, as is shown in Fig.6. It can be seen clearly that the simulation waveforms and 
theoretical results are consistent, and the output waveform of ODs is proved to be sine 
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wave; After OC, the final output voltage is rectangular wave pulse signal, therefore, the 
functions for optoelectronic signal A/D converter is achieved. Then, it begins to test 
frequency response feature, circuit and device parameters are ibid, selecting 26 
frequencies in the frequency range 0.1~100 kHz are measured, drawing out ampli- 
tude-frequency characteristic curve of the measured and simulation results by MAT- 
LAB soft, as Fig.7 shows. From the Figure, it can be seen that the results are consistent 
basically. 


4.3 Main Parameters of BICMOS OC 


Table | shows the other parameters of BiCMOS OC which are measured by experi- 
ment, it proves that the new proposed converter has extremely low power consump- 
tion( Pp~69 mW) , -3 dB bandwidth is about 58.4 kHz, the gain linearity NF is 
8.3x10°, the other parameters (e.g. input impedance, CMRR, leakage current) reach a 
good level too. 


Table 1. Measured parameters of the BICMOS A/D Photoelectric Converter 


“3 dB (KHz) Pp (Mw) 


5 Summary 


This proposed BiCMOS Schmitt OC module is very suitable for low-frequency dis- 
placement OCS. Its advantages include: ()With the advantages of BiCMOS devic- 
es—low power consumption and high integration, it can be designed into OC chip with 
low power consumption (69 mW) and small device size (0.15 um) to meet the trend of 
nano semiconductor devices; @)Under the situation of approximate same parameters, 
the two selected optical devices can reduce the impact of dark current, and also can 
reduce conversion error caused by temperature change; (8)As the lower level of +VDD, 
bandwidth is about 58.4 kHz, it is 

suitable for low frequency and low 10 


pressure. @ The change of circuit 3 OF 

parameters, especially the fluctuation i ~ 10}. 

of +VDD and light intensity change ¥ ~ 204. 

in workshop can affect the stability of 3 ~30!.. 

signal detection, so negative feedback & ~40}.. 

is introduced in OA A. Additionally, 2 ~50}- 

unipolar devices, isolation effect of = ~60 “4 

the source follower and strong an- a ay 0 10 10 as 
ti-interference ability of the Schmitt f/Hz 


Trigger are adopted, so the problem 
about the stability of OC can be Fig. 7. Amplitude-frequency characteristics 
solved. of OA A 
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Abstract. A kind of differential optical converter was designed, the whole 
conversion circuit used three op amp amplifier circuits, the voltage series neg- 
ative feedback was introduced in the first stage op amp Al, A2 which were 
used CMOS op amp. The voltage parallel negative feedback was introduced in 
the second stage op amp A3 which was taken into parameter symmetry diffe- 
rential input subtraction device. A3 was designed into BiCMOS op amp, so the 
whole circuit made up three op amp and triple loop control system. The territo- 
ry size on the operational amplifier was designed and the chip area was 0.44 
mmx0Q.31 mm, and the simulation and hardware circuit experiments were car- 
ried out. These experiments results show that the converter can use 3.0 V~5.0 V 
power supply, the gain linearity is up to 5.7x10-5, common-mode rejection ra- 
tio is 0.9x109. With these characteristics the designed converter is very suitable 
for high performance optical fiber communication systems. 


Keywords: BiCMOS and CMOS devices; differential optical converters; 
closed-loop systems; ASIC design. 


1 Introduction 


With the rapid development of computer networks, the universal application of 
multimedia communication and _ large-scale construction of information 
highway, high-performance fiber-optic communications and other optical transform 
system are needed urgently. The computers have the advantages of high speed 
operation, high reliability, data processing, storage and control advantages, so it 
has been widely used in optical measurement and control systems, especially in 
industrial process control and production pipeline. Simulation of electron converter as 
an application specific integrated circuit (ASIC), when the measured non-electrical 
signals (such as optical signal, medium thickness, displacement, etc.) load in the 
optical signal, it often uses the way of luminous flux to transmit to optoelectronic 
devices, optoelectronic device receives optical signals, while enabling photoelectric 
conversion circuit voltage output analog signals. The output voltage uo is the function 
of the measured signals Q, while uo=f( Q). Obviously uo is not only related with the 
Q and the carrier flux, but also related with the electron converter's parameters 
change. Therefore, design of a high-performance photovoltaic system and use of 
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convert information transformation ASIC module are both particularly important. 
According to inspection of domestic and foreign literature, design of high- 
speed, high accuracy, photoelectric conversion of low-power ASIC reports[1~2] has 
not been proposed. Therefore, it is necessary to use BiCMOS technology to design a 
differential electron converter and the operational amplifier chip to meet the need for 
optical engineering. 


2 Design of a Differential BiCMOS Electron Converter 


2.1 Dual Beam and Dual Optoelectronic Devices’ Design 


Figure | shows the dual beam and dual differential optical detection system device. 
Light source send the beam, and the beam through the mirror into road reference 
system and measuring optical system. The up output flux is received by optoelectron- 
ic device VD, and the down flux is received by optoelectronic device VD). The op- 
toelectronic devices VD, and VD,» are covered and packaged together in the tank. The 
VD, and VD,'s characteristic parameters should be consistent as possible[3]. 


Reference System 


Dimming Optoelectronic 
parts device VD, 


Source . 
Measured Optoelectronic 


lobject parts device VD, 


Measurement of optical system 


Fig. 1. Dual beam and Dual optoelectronic devices 


2.2 Differential Photoelectric Conversion Circuit 


Optoelectronic devices followed by a differential conversion circuit can be seen in 
Figure 2. VD, and VD, are accessed to the op amp circuit in the form of figure 2. 
Output voltage of optoelectronic devices VD; is uyp;, output voltage of the 
measurement system VD, is uyp2, Op amp A, and A, are connected to a series of 
negative voltage feedback circuit, to increase the input impedance differential 
circuit, and to drift the temperature offset and to improve the circuit's common mode 
rejection ratio. Amplifier A; is connected to form a symmetrical subtraction device 
parameters, Rg is in the introduction of a negative voltage feedback, so that the 
introduction of the three op amp are in the depth of negative feedback, which greatly 
improves the performance of many electron converter. Three op amp circuit voltage 
gain is: 


Uo! ( Uyp2—Uvp1) =( Ro/Rs) x[1+( 2R4/R3) ] (1) 
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Vop 


Fig. 2. Differential photoelectric conversion circuit 
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ai 


(b) A3: BiCMOS op amp 


Fig. 3. Differential conversion circuit op amp BiCMOS F/V converter 
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when R;=R,=R;5=R,, the output voltage can be written in more concise form: 
uo =3 Uyp2 —uyp1) (2) 


In fig.2, A, and A, are designed to CMOS operational amplifier, A; is designed to 
BiCMOS operational amplifier, and the entire optical converter is becoming a diffe- 
rential BiCMOS circuit[4~5]. Therefore, all the single and bipolar devices in the 
circuit can use the preferred device parameters in table | of the work[6]. 


2.3 The Design of Operational Amplifiers A1, A2, A3 


The internal circuit of CMOS A, and A, are showed in Fig. 3 (a). It is a two-stage large 
circuit: The first level is common-source amplifier, and the second stage is common- 
gate amplifier[7~9]. The internal circuit of CMOS A; is shown in Fig.3 (b). 


3 Design of op amp Chip 


The photo electric conversion circuits in fig.2 can be implemented by TSMC compa- 
ny’s standard using 0.25 um BiCMOS technology. The specific method is that each of 
A,(A,)and A; can separately be made into chip, or three chip circuits are made into a 
single ASIC. Because of the effect of optical device the level chip A,(Az)is larger 
than A3. Therefore, in the Cadence environment 
firstly complete A,;(A2)map design, and obtain para- 
sitic RLC parameter and the later simulation experi- 
ment. Simulation results show that because of the 
influence of circuit parasitic parameter, the circuit- 
bandwidth is declined, therefore by reelecting com- 
pensation capacitance value and optimizing plat 
fitter can attain the design demands. Fig.4 shows plat 
area of chip A;(A)), and die size is 0.44 mmx0.31 
mm. The same can design chip A; map, and die 
Fig. 4. Layout design of op amp _chip is the same as A,(A). 


4 The Experimental Results for Discussion and Analysis 


4.1 The Circuit of the Transmission Properties on the Voltage and Frequency of 
the Curve 


Firstly adopting this method to test the voltage of transport properties, and three 
discharge of the external resistors: R3=R4=Rs=Ro=100 kQ, and input resistance 
R,=R2=10 kQ; and amp impedance let parameters: Rp=Rp=100 kQ, C\=C,=10 pF; 
When testing input signal frequency sine is | kHz. Using MATLAB software to draw 
and simulate for obtaining voltage-transfer characteristic curve, and together with the 
hardware circuit test results of voltage transmission characteristics curve drawing 
in figure 5. From the curve, and the simulation result obviously is consistent 
with type(2) expressing the operation result in characteristics of drawing, and illite- 
rately the amplifier closed-loop gain of voltage is truly 3; But actual transmission 
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Fig. 6. The frequency response of photoelectric conversion circuit 


characteristic curve and the ideal simulation curve exists error, and this is mainly in- 
duced by external disturbance signal interference and parasitic capacitance charging. 

Then the frequency responding characteristic is tested. Circuit and device parame- 
ters are the same as above, and 14 frequency points in 50~100 kHz frequency range 
are measured, while drawing simulation results and the amplitude frequency charac- 
teristics curve by software MATLAB, as shown in figure 6. By the graph 
shows, practical testing and simulation results are consistent. When the voltage 
circuit amplitude dropped to 0.707 times of low frequency, and phase delay is about 
45°, the moment sinusoidal input frequency is more than 60 kHz, and illiterately 
three simulated transmittal circuits of bandwidth are more than 60 kHz. 


4.2 The Main Parameters of Photoelectric Conversion Circuit 


The main parameters of the experimental measured BiCMOS differential photoelec- 
tric conversion circuit are listed on Table 1, which prove the good performance of the 
converter design, its speed is faster (fpp is 15.4 ns), -3 DB bandwidth is about 77.3 
kHz, gain linearity is 5.7 10°, and other performance indicators (such as input 


impedance, common mode rejection ratio, leakage current) was also improved. 


Table 1. The main design of parameters differential photoelectric conversion circuit 


Amplifier 
: Input Leak: Suppl 
—3dB gain . ae sau wee Delay 
: : impedance current voltage 
/Hz linearity tpp /ns 


Z,/MQ Ip uA +Vpp IV 


NF 
773. | 5.7x10° 0.9x10° 3.0~5.0 | 15.4 
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5 Summary 


Differential BiCMOS optical converter is designed in this paper, and op-amp chip 
size is determined, the advantage of converter gain, bandwidth, power and speed is 
illustrated by experiments. The converter can be used for optical communications and 
other optical signal detection system, because it has the following advantages: 
(1) Using the characteristics of BiCMOS technology “strength of gain - bandwidth 
product and power - delay product of single and bipolar device” to achieve high-speed 
low-power electron converter indicators; (2) By selected two photoelectric device in 
the close of characteristic parameters situations, the dark current influence can be 
reduced, and conversion errors can be reduced when the temperature changes. 
(3) Optical path and reference optical path use the same light source, light source 
drift and volatility will not bring great influence on conversion results; (4) Changes of 
circuit parameters, especially +Vpp fluctuation will affect the stability of the meas- 
ured signal. In addition, Luminous flux is related with light, optical system and the 
performance of mechanical structures. Therefore, the operational amplifier introduc- 
es the deep negative feedback to solve the problem of design simulation of 
electron converter. 
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Abstract. In order to rationally optimize the clear lacquer practice in modern 
furniture production application, manchurian ash rotary cut veneers was used as 
the substrate. The experiment selected the biorthogonal wavelet to decompose 
wood picture using multi-level technique, and extracted low-frequency subgraph 
(LL) and high-frequency subgraph (HL, LH) on wood texture after clear lacquer 
and conduct multi-scale spectral analysis. The quantitative comparison of the 
texture parameters were studied and acquired transformation rule of the wood 
surface texture features via after coating process through nitrocellulose varnish 
(glossy, matt), alkyd varnish (glossy, matt) and polyurethane varnish (glossy, 
matt). The results showed that : (1) Feature vector obtained by wavelet, examine 
the value of sub-image and standard deviation of the energy reflected in the tex- 
ture characteristics, can effectively reflect the wood grain under several clear 
lacquer variation rule, characteristic and directional; (2) Texture enhancement 
significantly different, the overall effect performance is alkyd varnish (glossy) > 
nitrocellulose varnish (glossy) > polyurethane varnish (glossy) > Alkyd varnish 
(matt) > polyurethane varnish (matt) > nitrocellulose varnish (matt). Studies sug- 
gest that, clear lacquer is conducive to enhancing the substrate texture features, 
the effect is significant and positive for changing the visual effect. 


Keywords: Wavelet, Texture analysis, Digital image processing, Quantitation, 
Multiscale representation, Feature parameters, Clear lacquer. 
1 Introduction 


Clear lacquer is effective to preserve the natural wood grain and color, is to extend the 
service life of wood and enhance the natural wood pattern and color, and increases the 
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value of timber products effective technical measures, is the senior production of 
wood products, furniture and commonly used treatment process. Because of its com- 
position ratio and the finishing processes of the change will directly affect the timber, 
after finishing the visual effects, thus affecting the value of the goods and the con- 
struction of the level of the visual effect of environmental studies, so its control tech- 
nology has been developed for the paint manufacturers and furniture manufacturing 
industries of the importance [1]. 

There have been some scholars in China before and after finishing physical 
changes in wood vision a quantitative measurement (Jian Li, 1998, Yi-xing Liu, 1995, 
Xin-fang Duan,1998, Hai-peng Yu, 2005, however, after wavelet feature vectors 
obtained to study the energy value and sub-image as reflected in the standard devia- 
tion of the texture features, to comparative analysis of several transparent coating on 
the surface of the same visual effect veneer, not been reported so far[2-3]. 

The method of drawing on the basis of previous studies (Hai-peng Yu, 2005), lac- 
quer Nitrocellulose as a common market in general subjects, nitrocellulose varnish 
(glossy, matt), alkyd varnish (glossy, matt), polyurethane varnish (glossy, matt), which 
are commonly used for wood products painting in China for the experimental subjects, 
reference furniture factory standard finish on the wood finishing process, wavelet me- 
thod (filter length of 8, the decomposition scale is 2) multi-scale texture on the wood 
spectrum analysis, comparative study and summarizes the impact of their rules, is 
related to methods used in the furniture industry and decoration industry, improve the 
scientific nature of the paint color to guide the improvement of finishing processes, 
better learn to play the quality of wood play a role in the visual environment. 


2 Materials and Methods 


2.1 Materials Preparation 


Substrate: Manchurian Ash rotary cut veneers, size 100mm x 100mm x 10mm, air- 
dry at room temperature, moisture content 8.26%; 

Paint types: nitrocellulose varnish (glossy, matt), alkyd varnish (glossy, matt), po- 
lyurethane varnish (glossy, matt), Hua Run Paint Co., Ltd., Shunde, Guangdong; 

Process of clear lacquer: primer paint 2 times, coat finish 2 times. Shoot digital im- 
age, the image of the sample surface and the parts of the image is consistent with the 
material, if screenshots are necessary, please make sure that you are happy with the 
print quality before you send the files. 


2.2 Experimental Procedures 


Application scanner of EPSON1670, image sampling accuracy is set to 512 x 512 
pixels, gray levels of 256, saved as a BMP image format, procedures submitted to the 
texture. Presented a good image will be saved texture procedures, eigenvectors pro- 
gramming using Matlab wavelet sub-image from the vector concentrated extract tex- 
ture feature vectors needed for the analysis. 

Texture characteristic frequency decomposition: An image (two-dimensional sig- 
nal) obtained by the decomposition of 4 sub-plans: (1) subgraph (LL); (2) subgraph 
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(LH); (3) subgraph (HL); (4) subgraph (HH), in Fig. 1. This is the nature of the sub- 
graph: a’ in the horizontal and vertical directions have low-pass characteristics, sub- 
graph (LL) concentrated original image of mainly low frequency components, from 
the visual point of view, approximate profile of the image information; d'™ in the 
horizontal direction for the low-pass characteristics, high-frequency characteristics in 
the Vertical direction, from the visual point of view, subgraph (LH) well preserved 
the original image level of the boundary line of the boundary points; d"" in the vertic- 
al direction for the low-pass characteristics, high-frequency characteristics in the 
horizontal direction, from the visual point of view, subgraph (HL) well preserved the 
original image of the vertical boundary line boundary points; d"" in the horizontal and 
vertical directions have high-frequency characteristics, from the visual point of view, 
subgraph (HH) retain only a few scattered boundary points of the original image[4-5]. 

These subgraph fully reflect the image at different scales, different frequency tex- 
ture features in different directions, for the image analysis and classification provide a 
good foundation. 


Vertical middle 
frequency subband 


Horizontal middle 
frequency subband 


Fig. 1. Two-dimensional duple discrete wavelet decomposition 


Texture feature extraction. Matlab programming using wavelet decomposition sub- 
image from the vector extraction parameters required for texture analysis. 
Wavelet energy distribution: For an N x N size image, and its energy distribution is 


defined as: 
Ef = ) ) — 7 (1) 
N- 


m=1 n=1 


The multi-scale wavelet decomposition, the original image detail sub-images LL, LH 
and HL k-order wavelet energy distribution is defined as: 


ni2* k) 2 
[LH (m,n) | 
HH” = ») = Q) 


m=(N/2")+1 n=1 


k-1 
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(4) 


The proportion of energy and the directional characteristics of a special detail sub- 
image to all details sub-image of the same scale, is defined as ratio of wavelet energy 
distribution. 


(k) 
ELH 
EPLH® = 
nip” iggE” + me” 


EHW” 


(k) a 
EPHL® = (6) 
ELE’ 4Hpe” + BoB” 
(k) 


ELE’ .e8L° + eee” 


Thus, EPLH™ reflected the original image in the horizontal direction energy distribu- 
tion and specific gravity; EPHL™ reflected the original image in the vertical direction 
energy distribution and specific gravity; EPHH” reflect the original image in the 
diagonal direction energy distribution and specific gravity (Hai-peng Yu, 2005). 


3 Results and Discussion 


3.1 Visual Comparison of Samples Before and After Clear Lacquer 


For the clear lacquer of wood samples before and after visual observation of the visu- 
al image can be seen: Compared with the substrate, after finishing in the visual sense 
of color more vivid, light saturation, texture becomes more obvious and clear. Clear 
lacquer help enhance transparency texture coating, the effect of changing the visual 
effect is significant and positive [6]. 


3.2 Texture Characteristic Analysis Before and After Clear Lacquer Finishing 
Changes 


Because the image by the wavelet first and second decomposition, the texture infor- 
mation with the gradual decomposition of the increase in the number of amplification, 
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decomposition of the second layer, fully reflects the texture information, standard 
deviation reaches its maximum, the most obvious difference in texture reflect, there- 
fore, choose the decomposition scale of 2. 


Sub-images LL analysis of texture parameters. Tab. 1, Fig. 2 were calculated and 
analyzed by the orthogonal design assistant 2 obtained wavelet decomposition on the 
sub-images LL visual analysis of the energy value chart table, variance analysis 
diagram. 


Table 1. Subgraph visual analysis of the energy value table before and after clearing lacquer in 
WaVenLL 2 wavelet decomposition scale. 


Decomposition scale at 2 


Sub-image types Treatment Before After Difference 
WavEnLL Alkyd varnish (glossy) 28604.39 13991.9  -14612.49 
Alkyd varnish (matt) 25885.07 20020.96 -5864.11 


Polyurethane varnish (glossy) 35604.61 27770.91  -7833.7 
Polyurethane varnish (matt) 30857.27 31220.37 363.1 
Nitrocellulose varnish (glossy) 42657.76 30760.81 -11896.95 
Nitrocellulose varnish (matt) 41834.29 43202.73 1368.44 
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Fig. 2. Subgraph visual analysis of the energy variance chart before and after clearing lacquer 
in WaVenLL at 2 wavelet decomposition scale 


Tab. | and Fig. 2 shows, subgraph (LL) in the low frequency components, after po- 
lyurethane varnish (matt) and nitrocellulose varnish (matt), energy values were in- 
creased, difference was 363.1,1368.44,the others approach is to make the substrate 
energy value decreased, reduce the rate of alkyd varnish (glossy) > nitrocellulose 
varnish (glossy) > polyurethane varnish (glossy) > alkyd varnish (matt); Clear lacquer 
(glossy) and alkyd varnish (matt), texture enhancement, the degree of enhancement is 
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directly proportional to reduce the magnitude of the energy value; Nitrocellulose 
varnish (matt) and polyurethane varnish (matt), texture enhancement. 


Sub-images LH analysis of texture parameters. Tab. 2, Fig. 3 were calculated and 
analyzed by the orthogonal design assistant 2 obtained wavelet decomposition on the 
sub-images LH visual analysis of the energy value chart table, variance analysis 
diagram. 


Table 2. Subgraph visual analysis of the energy value table before and after clearing lacquer in 
WaVenLH 2 wavelet decomposition scale 


Decomposition scale at 2 


Sub-image types Treatment Before After Difference 
WavEnLH Alkyd varnish (glossy) 5.119006 7.598602 2.479596 
Alkyd varnish (matt) 7.00006 = 3.978562 = -3.021498 


Polyurethane varnish (glossy) 4.008253 6.366469 2.358216 
Polyurethane varnish (matt) 7.83156 2.835516 -4.996044 
Nitrocellulose varnish (glossy) 3.978271 7.387462 3.409191 
Nitrocellulose varnish (matt) 5.73135 3.253840 -2.47751 
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Fig. 3. Subgraph visual analysis of the energy variance chart before and after clearing lacquer 
in WaVenLH 2 wavelet decomposition scale 


Tab. 2 and Fig. 3 shows, subgraph (LH) in the high frequency components, after 
alkyd varnish (glossy) and polyurethane varnish (glossy), energy values were in- 
creased, Reduce the rate of Nitrocellulose varnish (glossy) > alkyd varnish (glossy) > 
polyurethane varnish (glossy), the others approach is to make the substrate energy 
value decreased. Because with the texture from coarseness to fineness, from weakness 
to strength, subgraph (LH) is strengthened gradually, so we can see, clear lacquer 
(Glossy) treatment, substrate in the horizontal direction (horizontal grain direction) 
texture enhancement, enhancing the extent and magnitude of increase is proportional 
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to the energy value; Clear lacquer (matt) treatment, substrate texture weakening in the 
horizontal direction, because the energy value is reduced by a big margin, so the 
greater the degree of weakening. 


Sub-images HL analysis of texture parameters. Tab. 3, Fig. 4 were calculated and 
analyzed by the orthogonal design assistant 2 obtained wavelet decomposition on the 
sub-images HL visual analysis of the energy value chart table, variance analysis 
diagram. 


Table 3. Subgraph visual analysis of the energy value table before and after clearing lacquer in 
WaVenHL 2 wavelet decomposition scale 


Decomposition scale at 2 


Sub-image types Treatment Before After Difference 
WavEnHL Alkyd varnish (glossy) 56.02797 26.82398  -29.20399 
Alkyd varnish (matt) 46.40508  34.02359  -12.38149 


Polyurethane varnish (glossy) 28.59873  35.98652 7.38779 
Polyurethane varnish (matt) 37.44300 19.2408 -18.2022 
Nitrocellulose varnish (glossy) 25.61983 29.34062 3.72079 
Nitrocellulose varnish (matt) 27.48571  20.31202  -7.17369 
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Fig. 4. Subgraph visual analysis of the energy variance chart before and after clearing lacquer 
in WaVenHL 2 wavelet decomposition scale 


Tab. 3 and Fig. 4 shows, subgraph (HL) in the high frequency components, after 
polyurethane varnish (glossy) and Nitrocellulose varnish (glossy), energy values were 
increased, difference was 7.38779, 3.72079, the others approach is to make the sub- 
strate energy value decreased. Because with the texture from coarseness to fineness, 
from weakness to strength, subgraph (HL) is strengthened gradually, so we can see, 
nitrocellulose varnish (glossy), polyurethane varnish (glossy) treatment, substrate in 
the vertical direction (along the texture direction) texture enhancement, enhancing the 
extent and magnitude of increase is proportional to the energy value; Alkyd varnish, 
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polyurethane varnish (matt) and nitrocellulose varnish (matt) treatment, substrate 
texture weakening in the vertical direction, because the energy value is reduced by a 
big margin, so the greater the degree of weakening. 


4 Conclusions 


Wavelet analysis, multi-scale texture of the wood spectrum analysis, transparent coat- 
ing can effectively direct quantitative visual effects on the wood surface influence. 

The results showed: Clear lacquer (glossy) and alkyd varnish (matt), texture en- 
hancement, the degree of enhancement is directly proportional to reduce the magni- 
tude of the energy value; nitrocellulose varnish (matt) and polyurethane varnish 
(matt), texture enhancement. Clear lacquer (glossy) treatment, substrate in the hori- 
zontal direction (horizontal grain direction) texture enhancement, enhancing the ex- 
tent and magnitude of increase is proportional to the energy value, clear lacquer 
(matt) treatment, substrate texture weakening in the horizontal direction; Nitrocellu- 
lose varnish (glossy), polyurethane varnish (glossy) treatment, substrate in the vertical 
direction (along the texture direction) texture enhancement, enhancing the extent and 
magnitude of increase is proportional to the energy value, alkyd varnish, polyurethane 
varnish (matt) and nitrocellulose varnish (matt) treatment, substrate texture weaken- 
ing in the vertical direction. 
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Abstract. Gansu Science & Technology Documentation Sharing Platform con- 
sists of five systems: full-text retrieval and web-publishing system, heterogene- 
ous digit resources unitive search system, original text delivery system, user 
management and accounting system, statistical analysis system. The application 
and technique architectures of the platform were elaborated in this paper, and the 
major key technologies on the platform were also expounded, which include un- 
itive search system, web2.0, web services and data security. The platform having 
been running shows that it integrated 173 resource databases, implemented "one 
stop" services, improved document resource integration degree., improved 
service quality, level of management and market competitiveness capacity of do- 
cumentary information organizations, reduced the repetitive investment of doc- 
ument resource and the development of duplicate of databases resource which 
have the same content. 


Keywords: document sharing, architecture, unitive search, original text deli- 
very, web service. 


1 Introduction 


July 2004, the general office of state council forwarded "2004-2010 National Science 
and Technology Infrastructure Construction Program", and proposed the target of build- 
ing science and technology infrastructure which is resource-rich, layout reasonable, 
technologically advanced, fully functional, operational efficiency. Currently, from na- 
tional to provincial and municipal, they builded science & technology documentation 
sharing platform at all levels so as to provide better service for the scientific innovation. 
Since 2005, Gansu province relied on Institute of Science & Technology Information of 
Gansu as the main undertaker for building Gansu Science & Technology Documenta- 
tion Sharing Platform (http://www.gsstd.cn), to the present, nearly 2,000 individuals and 
104 group users registered, in the construction process, building models and key tech- 
nology of the platform are very worthy of study and reflection, which sum up, can serve 
the future development of the role of inspiration and reference. 
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References [1] proposed four modes of current science & technology documenta- 
tion sharing platform and analized in detail: resource-oriented mode, integrating ser- 
vice mode, technological application mode and comprehensive mode. Gansu science 
& technology documentation sharing platform is comprehensive, joint catalog, refer- 
ence, the original text delivery and other work shows the construction model of 
integrated service, on the other hand, resource integration reflects technological appli- 
cation. At present, the platform integrated literature resources in seven major collec- 
tion of literature units, a total of 173 resource databases. Document types include 
journal papers, standards, patents, dissertations, union catalog, report of the meeting, 
agency products, local characteristic resources, network of development research cen- 
ter of the state council family library literature, through the unitive portal platform 
website to offer free secondary document search services for community, according to 
the user's need to provide the appropriate primary document paid services. 


2 Platform’s Architecture 


Software architecture directly restricts success or failure of software development[2]. 
The design and implementation of software development of science & technology 
documentation sharing platform which was acceptable and robust has important signi- 
ficance to its construction. Based on previous researchers’ R&D harvest, and accord- 
ing to requirement of construction and maintenance, its software architecture was 
constructed from different aspects of application and technology. 


2.1 Application Architecture 


This platform is composed of five systems, as shown in fig. 1. 


system Tr strator 


eres | text retrieval user managernent and || statistical rua 
web- eres | ces accounting systern rua 
Users f")) sery- 
ice 


[_ heterogeneous digit resources unitive search system | digit resources unitive [_ heterogeneous digit resources unitive search system | system 


resource database resource database 


Fig. 1. Application Architecture 


1) www service 

WWW service including resource directory and notice information. It supplied entry 
of system such as online registration, personal information query, accounting query 
and document delivery query. The entry of www servie supplied entry of registered 
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users and nonregistered users. Nonregistered users cannot download document and, 
but can search title and abstract. 


2) full-text retrieval and web-publishing system 

As construction unit of documentation, information which is massive and repeatedly 
used was collected. However, in these data that cannot be transformed into field such 
as text, image, audio, video, compound document, it also accounts for certain propor- 
tion. They cannot be efficiently handled by traditional RDBMS. And their real value 
was substantially reduced. Meanwhile, as structured information which could be effi- 
ciently handled by traditional RDBMS, It has defects such as processing speed slow, 
none-uniformity, insufficeient that indexed them. It could not meet information swift 
growth needs. 

The full text retrieval takes the text data as the main processing object, provides the 
advanced inquiry method according to the material content but not the external cha- 
racteristic realizes. This platform uses full text retrieval system which was widely 
applied in the domestic books intelligence system——TRIP(http://trip.istic.ac.cn/ 
html/tripchn/docs/trip.htm). 


3) heterogeneous digit resources unitive search system 

The unification retrieval system refers to the user submit retrieval request through 
sole and user-friendly interface which could access many web databases and search 
engine at a time, gain more accurate and orderly retrieval result. It shows high preci- 
sion and retrieval efficiency in higher recall. 

User may retrieve various resources database on one retrieval and two retrievals. 
The field include title, author, full text, keyword, category number and so on. The 
inquiry condition was saved which could directly use in the later inquiry. The retriev- 
al result display by paging. User can collect the retrieval result and browse digest in- 
formation. If it has online full text in retrieval result, registration user may directly 
download it which could be automatically accounted by system. If it has offline full 
text, user could gain full text by original text delivery. 


4) original text delivery system 

It process request of original text delivery which user submit from search result. Before 
processing request, it authenticate users’ identity and the balance of account. If passed, 
request of original text delivery would be transmited to corresponding collection unit. 
It adopts advanced service pattern which is end user-oriented. Readers submit request 
of original text delivery on line by myself and gain full text in email when register in 
this system. The entire process does not need any third party involvement. The way of 
gaining literature is convenient, quick, conforms to reader's information acquisition 
custom under especially informationization swift development environment. 


5) user management and accounting system 

User management include several service such as user registeration, user manage- 
ment, send messages, send email, batch processing on user period of validity, batch 
processing on user prestore fee and user imformation online query. Among them, 
sends messages or email to the registered user is advantageous to publicize, then two 
batch processing functions greatly raised the literature service efficiency. The user 
management also provides the user authenticate [3] and the jurisdiction inspection, it 
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assists to complete the user authentication of the literature WWW service, the unifica- 
tion retrieval and the original text delivery system. 

Accounting system completes account of full text downloading and the original 
text request, it may recharge and refund money for the user. 


6) statistical analysis system 
According to the received document retrieval services, full download and the original 
ordering information, we count the use of various types of documents resources, users 
geographical and age distribution to provide reliable data for the decision analysis. 
Statistical analysis of original text delivery system provides "Costs Statistics ", 
"User Statistics", "Sended Request Statistics", "received request statistics" and other 
modules, notably the three statistical feature of the design can be described as origi- 
nality. "User Statistics" provides the results from the user education, job title, type, 
total number of accounts, using or not using the number of multiple dimensions of 
analysis, screening: the "send or receive requests statistics" module, not only count 
each document collection unit of sended or received requests number, but also pro- 
vide details of the treatment results: satisfied, not satisfied and the specific reasons 
of not satisfied, the service hours, tolls. This is easy to understand document service 
units of the various documents for the processing of collections, the museum services 
of external circumstances, from mining problem, then the document collection for 
each unit of the library resources development, training and readers to communicate 
with the outside hall and other work provide the basis for the launching. 


2.2 Technical Architecture 


This platform based on php and Web Service to view, business logic and data layers of 
a reasonable division of the overall framework of the system more optimized. Technic- 
al architecture was divided into the portal layer (service channel level), application 
support layer, application layer, information resources layer, as shown in Fig. 2. 


browser 


web server 


client 


web server 


business application 
layer server 

resource resource 
layer database cluster 


Fig. 2. Technical Architecture 
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System architecture with four layers: client (request information), business 
(process the request) and data (to operate) which are physically isolated. Client layer 
consists of the browser and rich client, the browser interface for the display system, 
the rich client to handle ajax requests and XML data. Service layer consists of Web 
servers and application servers of literature gateway, Web server handle display logic, 
application servers of literature gateway deal with document retrieval business logic, 
business logic in the middle layer, do not care about what type of client request data, 
but also with the back-end system to maintain relative independence, is conducive to 
system expansion. Four- layer structure has better portability, you can work across 
different platforms, allowing users to request load balancing between multiple serv- 
ers. Security is more easy to implement, because the business logic has been isolated 
with the customer. 

Four- layer communication between the following: The browser through Ajax 
asynchronous call to send user requests to the Web server, Web server and the litera- 
ture gateway were used between the SOAP protocol, the gateway application server 
through HTTP, Z39.50, ODBC or JDBC with the literature resources such as 
databases. 


3 Key Techniques 


3.1 Unitive Search 


Unitive search [4] also known as cross-database searching, unitive search system 
must work around two of the most significant features such as the heterogeneous and 
distributed computing, as long as the shield of heterogeneous database resources, rea- 
listic solutions will be able to propose with rational use of distributed processing 
technology to achieve a unitive document resources inquiries. 

From the technical point of view, there are two kinds of unitive search model: 
joint search and integration search (http://www.chnlib.com/zylwj/shuzitsg/200605/ 
221.html).Joint search process commonly used simulation Web access [5], the search 
criteria which was entered in unified search interface will be automatically saved and 
sended to multiple digital resources system, the digital resources system start their 
search system to search and show search results in the same interface. 

In method of implementation of web-based simulation, the core technology is web 
information extraction. This paper proposes a new method which can extract the use- 
ful information from the different document sites automatically——web information 
extraction based on sub-tree breadth [6].The method is that view title number of per 
page of scientific and technical literature web site and store the number in the data- 
base, and then use the HTML Tidy(http://tidy.sourceforge.net) to clean up these pages 
into XML documents, and then generate DOM tree, computing breadth of someone 
sub-tree in DOM tree, by judging the breadth of a sub-tree is equal title number of per 
page of pre-stored in the database to establish the key information block[7-8], and 
then extract information from key information block. Experiments show that the me- 
thod can guarantee a high accuracy in terms of recall and precision [9]. 
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3.2 Web2.0 


To improve user experience, the system uses Ajax, Tag and other Web2.0 technology 
to implement personalized service. Personalized service refers to users can obtain 
information or services by their own purposes and needs. For example: Users can 
collect their own commonly used search document type and commonly used literature 
database by tag for later use. In user's document classification, a free label was used, 
allowing users to freely define. In resource database display, ajax was used, the user 
clicks the label to select resources database list of data output, in the whole operation, 
user does not need to refresh the entire page. In addition, my database, my favorites, 
my search history is the performance of personalized service. 

Although the main advantage of PHP is superior processing character speed and re- 
liability, through combination of Apache 2.0 makes the unitive search system has 
good stability and performance [10], but it does not support multi-threading, and un- 
itive search system needs to search multiple databases at the same time, if an ordinary 
single-threaded program, processing speed will slow that people can not tolerate. 
Multi-task programming techniques was improved by ajax to enhance the program 
efficiency and avoid a "suspended animation"state of program interface. 


3.3. Web Service 


The distributed store of digital resources is one of its main features, the traditional 
unitive search system was designed for a specific portal, it was lack of sharing re- 
sources and accounting between sharing units. System needs to establish Web service 
program for title search, abstracts obtain and full-text download by asynchronous call 
procedures, results transfer by XML file between users and documents node group, it 
achieved grid resource sharing, resource exchange visits and billing. For users, docu- 
ment search services mainly refers to it receives user requests and then starts the 
searching machine, and a XML file of search results is generated by services. The 
main services include resource list service, browsing titles service, browsing abstracts 
service and full-text download service. 


3.4 Data Security 


This platform provide services by network, in this process, how to protect data securi- 
ty, mainly depend on following several technology: (1)strengthen right setting, legi- 
timate users can be accessed by password; or through the IP address settings, users in 
IP segment can be accessed. (2)restrict data traffic of accessing databases to prevent 
malicious downloads and databases collapse. (3)using encryption and digital signature 
technology during network transmission, to prevent theft and destruction. 


4 Conclusion 


Key technology and the model of this platform has been successfully applied in addi- 
tion in Gansu province, but also promote to China petroleum, Qinghai prov- 
ince(http://www.textqh.com), Ningxia province(http://www.nxkjwx.com.cn) and 
other provincial-level scientific and technical document sharing platform. The 
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practice shows that the platform improve their integration degree. This research offers 
a feasible approach for the realization of "one stop" services. The application of this 
platform greatly promoted the process of information sharing of documentary infor- 
mation organizations, improved service quality, level of management and market 
competitiveness capacity of documentary information organizations, reduced the re- 
petitive investment of document resource and the development of duplicate of data- 
bases resource which have the same content. 

Along with the development of science & technology documentation sharing plat- 
form, research and development of heterogeneous digital resources unitive search will 
enter a deeper level, function will get further rich, but unitive search based on web 
service was created to solve non-standard data interface of vendors, and access and 
retrieval interface which was based on standard and norms of resource database is the 
solution to the current "information island" phenomenon of the effective ways and 
means.Our country should early strengthen standards and norms and compulsory 
promotion to complete our country digital libraries as soon as possible. 
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Abstract. In this paper, an efficient method to relax timing requirements of 
CRFF sigma-delta modulators has been proposed. A system optimization to cir- 
cuit level design was finished. Class-C inverter was used to realize half delay 
integrators of the proposed structure. A 4th-order 1-bit CRFF topology was im- 
plemented in smic 0.134m CMOS technology. With 31.25MHz sampling fre- 
quency and 64x oversampling ratio, 13.2-bit resolution has been reached. The 
whole circuit consumes 472.63-1W power from a single 0.6V supply voltage. 
Thus, a low-voltage low-power medium-bandwidth high-accuracy sigma-delta 
modulator has been obtained. 


Keywords: sigma-delta modulator, switch-capacitor (SC) integrator, class-C 
inverter, low-distortion topology, low-power. 


1 Introduction 


Sigma-delta data conversion technique is commonly used in many fields such as 
wired and wireless communication systems, consumer electronics, radar systems and 
medical applications because of its high-accuracy and wide-band. In order to satisfy 
the ever-increasing demands for portable devices, the development of sigma-delta 
modulators (SDM) whose power in macro watt rang has become a hot research topic 
recently [1][2][3]. 

Although complex structures, such as cascade and multi-bit modulators, could be 
used to realize extra high resolution and speed, high power consumption is inevasible. 
Of course, continuous-time (CT) implements seem to show lower power consumption 
than switch-capacitor (SC) ones, however, high sensitivity to clock jitter and large 
coefficient warp restrict their design. Thus, simple SC architecture, i.e. single-loop 
single-bit SC modulator, is still the main method to realize low-power high- 
performance sigma-delta modulators. In addition, the low-voltage low-power perfor- 
mance of feed-forward topologies could be superior to the feedback ones, since they 
have reduced internal signal swings by cancelling the input signal at the input of loop 
filters. 

Therefore, single-loop single-bit low-distortion cascade of integrators with distri- 
buted feed-forward form (CIFF) and cascade of resonators with distributed feed- 
forward form (CRFF) [4] could be effectively used for low-power applications. For 
high-speed modulators realized by SC integrators, there are advantages to have a 
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delay associated with each integrator, since it reduces the speed requirements of the 
operational amplifiers (op-amps) used. As a result, CIFF such as [5] is used more than 
CRFF which needs a delay-free integrator in each resonator to insure the poles stay on 
the unit circuit [4]. However, as a kind of effective topology, CRFF also including 
other sigma-delta topologies which contain delay-free integrators ought not to be 
limited to theory. This paper suggests a feasible method to realize the CRFF 
structures. 

Our design goal is to finish high-resolution (Effective-Number-of-Bits ENOB>13- 
bit), medium-bandwidth (input signal bandwidth fb>200 kHz) and low power (Pow- 
er<lmW) sigma-delta modulators, so that high figure-of-merit (FOM) performance 
can be reached. 

This paper is organized in the following manner. Section 2 provides the system 
level considerations of the modulators. Section 3 describes circuit implement of the 
SC circuits. Section 4 shows the experiment results of the designed sigma-delta 
modulators. 


2 System Considerations 


2.1 Proposed Structures 


Take a 4th-order 1-bit CRFF SDM as shown in Fig.1 for example, and Fig.2 is the 
proposed structure. V and U denote the output and input of overall modulator, respec- 
tively. The key is to use half delay integrators instead of the delay-free ones so that 


Fig. 2. Block diagram of the proposed 4th-order 1-bit CRFF SDM. 
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rigorous timing requirements could be relaxed. For this purpose, the method of equi- 
valence transformations has been used to change all the integrators to half delay ones, 
and meanwhile, half delay elements have been introduced in the first and third inte- 
grator feed-forward paths. As a result, the signal (STF) and noise transfer function 
(NTF) of the CRFF would not be changed, and the timing could be relaxed as well. 


2.2 Coefficient Selection 


In order to satisfy our design goal, through behavior simulation and design margin 
consideration, a 4th-order single-loop 1-bit CRFF form was chosen as the proposed 
topology, since they balance high resolution, high speed and low circuit complexity 
effectively. 

Using the ‘delsig’ toolbox available on [6], an inverse Chebyshev NTF which has 
local feedback path has been derived in the discrete-time domain and the coefficients 
of the sigma-delta topology could also be produced. The order of the NTF has been 
chosen to 4, oversampling ratio (OSR) is 64 and the sampling frequency (fs) is 
31.25MHz. The maximum out-of-band gain of NTF Hinf = 1.5 has been chosen to 
balance resolution and loop stability. Dynamic-range scaling has also been performed 
to restrict integrators’ output swings to known and practical values. 

After simulation and scaling, the final scaled values used are shown in Table 1. 
Here, for the sake of using unit capacitors to ensure the capacitor ratios to be insensi- 
tive to fringing effects and processing-induced length and width variations in real 
circuit implement, rational approximations have been introduced as well [4]. With the 
coefficients, ideal CRFF could exhibit 16-bit resolution in system simulation using 
Matlab & Simulink. 


Table 1. Modulator’s Coefficients 


Integrator Resonator Feedback Feed-forward 
coefficients coefficients coefficients coefficients 
a=0.4005 — 2/5 g=0.0083 — 1/120 a=0.4005 — 2/5 fO=1 

c1=0.3929 — 2/5 f1=1.388 — 25/18 
c2=0.2799 — 7/25 f2=1.5804 — 79/50 
c3=0.2076 — 1/5 £3=1.1718 — 34/29 


f4=0.6274 — 19/30 


2.3 Non-idealities Simulation 


For the sake of finding out the requisite design specifications of building blocks, the 
proposed 4th-order 1-bit CRFF sigma-delta modulator first has been simulated using 
MATLAB environment with the method of [7]. Here, a -4.1245dB full-scale 64.85 
kHz sine wave has been used as input signal. After the behavior level simulation, the 
required amplifier DC-gain which determines the accuracy of charge transfer and 
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GBW that determines its operation speed for the first integrator is 35dB and 40MHz 
respectively, and its sampling capacitor is 4pF for 86.9dB SNDR, as seen in Fig.3. 


PSD of Sigma-Delta Modulator (detail) 
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Fig. 3. Output power spectrum with 8192 samples of real CRFF modulator modeled in Matlab 


3 Circuit Implementation 


The major building blocks of the modulator include SC integrators, 1-bit quantizer, 
1-bit feedback DAC, switches, clock bootstrap and clock generator. 


3.1 Integrator Design 


In the proposed modulator structure, all the integrators are implemented with half 
delay forms. This could be done by switched-RC integrators proposed in [8]. Howev- 
er, [3] shows us a more power-efficient method to realize half delay integrators. It 
suggests us to use inverters to substitute general op-amps. In fact, the op-amp is the 
most vital component for low-power sigma-delta modulator, since it dominates the 
performance and consumes most of power of the overall system. Thus, to find out a 
kind of high performance and low cost op-amp satisfying the requirement of sigma- 
delta modulator has become an important work for low-voltage low-power design, 
and using inverter as op-amp gives us a new way to this job. 

As shown in Fig. 4, the inverter-based half delay SC integrator using an auto- 
zeroing technique to cancel the offset and form a virtual ground for the integrator. The 
node connecting the bottom plate of sampling capacitor Cs could be considered as a 
virtual ground, and the charge should be transfer from Cs to integrator capacitor Ci as 
the conventional SC circuits. The capacitor Cc has be used to sample and hold the 
offset voltage causing by the one input terminal inverter. A common-mode feedback 
(CMFB) circuit has been used in this kind of pseudo-differential SC integrator, and 
gain of the CMFB is defined as the ratio of Cy,/Ci. 
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Fig. 4. Schematic of the proposed complete CRFF modulator. 


Table 2. Inverters’ Performance 


DC gain GBW Slew rate (SR) Power 
41.13dB 63.543MHz 51.35V/ps 8.425uW 


As for the inverter, a simple push-pull inverter is more suitable than the cascode 
one in a low power supply voltage Vdd, since a simple inverter could still obtain both 
high DC-gain and wide GBW with very low Vdd. Based on the PMOS and NMOS 
transistors’ threshold voltage of our technology, 0.6V power supply voltage should be 
chosen to insure class-C operation which is the most power efficiency and high per- 
formance for an inverter [3]. Table 2 shows the basic performance parameters with 
10pF load capacitor of the inverter used in the first integrator. Considering the non- 
idealities of other building blocks in the modulator that have not been modeled in the 
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above Matlab simulation, higher performance inverter has been used in the circuit 
level design. 


3.2 Other Building Blocks 


The modulators are controlled by two phase, non-overlapping clocks: one for the 
sampling phase and the other for the integrating phase. In order to reduce the effects 
of charge injection, delayed clocks (®1d and ®2d) are needed. Thus, an on-chip clock 
generator was designed to generate the above clocks. 

Since the Vdd is 0.6V, the input and output common-mode voltage is set to 0.3V. 
And then, the feedback reference voltages Vref+ and Vref- are chosen as the power 
supply rails Vdd and ground. 

A traditional clocked comparator which consumes dynamic power only, followed 
by latch has been used as the 1-bit quantizer. The 1-bit feedback DAC is implemented 
by four transmission gate switches which are controlled by the output voltage of over- 
all modulator. 

All switches are implemented by transmission gates and driven by a pair of invert 
clock signals. A simple bootstrap circuit [9] is used to raise the voltage level of the 
clock in order for turning on the switches completely. In addition, sharing sampling 
capacitors and redundant switches techniques are introduced in the circuits to save 
power and chip area. 

The complete schematic of the overall CRFF modulator is displayed in Fig. 4. 


4 Simulation Results 


This circuit has been realized using Cadence environment under smic 0.134m CMOS 
technology. Using MATLAB, FFT with hanning window has been used to calculate 
signal to noise and distortion ratio (SNDR) under 8192 samples. 

The output swings of each integrator in the CRFF sigma-delta modulator are 
shown in Fig. 5 (left). Here, a 64.85 kHz, -2dB to full scale (dBFS) sinusoidal input 
signal has been used. As shown, besides the output swing of the first integrator which 
is a +0.5V waveform, the rest ones present relative small output levels ranging from 
+0.2V to £0.3V voltage, all of that exhibit the low-distortion property. 

Fig. 5 (right) plots the SNDR versus input amplitude. As seen from that, with 64x 
OSR, the proposed modulator could finish a large dynamic range (DR). The characte- 
ristics of the overall 4th-order 1-bit CRFF SC sigma-delta modulator are summarized 
in Table 3. The FOM for measuring performance is defined as [3]: 


_— Power (1) 
~ 2-2ENOB . Bandwidth 


In Table 4, it is a comparison of the performance for several recent low-voltage low- 
power sigma-delta modulators. It is obvious that the proposed structure has high per- 
formance among recent designs. In addition, as shown, in general, low-power designs 
are used as ten kHz bandwidth range, while our design could be used as hundred kHz 
applications. 
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Fig. 5. Integrator output swings (left) and SNDR versus input amplitude of the CRFF (right) 
with 64.85 kHz, -2dBFS input signal. 


Table 3. Modulator’s Performance Summary 


Supply Technology fs OSR fb Power Peak ENOB DR  FOM 
Voltage SNDR 
0.6V smic 31.25MHz 64 245kHz 427.63uW 81.2dB 13.2bits 95dB 0.103 
0.13um pJ/conversion- 
CMOS step 


Table 4. Performance Comparison 


Paper Vdd [V] CMOS BW SNDR DR [dB] Power FOM 

[um] [kHz] [dB] [uW] [pJ/conv] 
Roh [1] 0.9 0.13 20 73 83 60 0.412 
Par [2] 0.7 0.18 25 95 100 870 0.378 
Cha [3] 0.7 0.18 20 81 85 36 0.098 
Su [5] 1 0.18 16 63.4 N/A 18.1 0.468 
Ahn [8] 0.6 0.35 20 81 82 1000 2.731 
Su [9] 1 0.18 20 61.96 66 42 1.025 
This 0.6 0.13 245 81.2 95 472.63 0.103 
work 


5 Conclusion 


A method to overcome tight timing requirements of CRFF sigma-delta modulators 
have been proposed in this paper. Class-C inverters have been used as the amplifiers 
in half delay integrators. Then, a 4th-order 1-bit CRFF modulator has been imple- 
mented under 0.13m CMOS technology with only 0.6V supply voltage. Compared 
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to the other low-power structures, this design achieves high FOM performance. It 
indicates that both power and performance are well optimized. 
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Abstract. With the rapid developing of electronics, communication and em- 
bedded system technologies, some specific applications of the Internet of 
Things have made a success in the military and civilian aspects. The application 
of the Internet of Things in Nilaparvata lugens and environmental online moni- 
toring is an effective solution for present wired sensor monitoring system, 
which has much more disadvantages, such as complex wiring, susceptible to in- 
terferences, poor reliability, poor real-time capability, difficult maintenance, 
heavy repair work load and so on. In this paper, a novel wireless sensor node in 
the Internet of Things is proposed to achieve the functions of automatic data 
collection, transmission and the real-time promulgation. The wireless sensor 
nodes include a temperature sensors, a Humidity Sensors, a module of data 
processing based on ARM, a wireless communication module, etc. The whole 
structure of system is given and the key technologies of system design are in- 
troduced. In additional, the design and its realizing method are described in the 
hardware aspect. The system has the advantage of simple structure, low-budget, 
low power consumption, etc. This makes the system applicable to Nilaparvata 
lugens and environmental online monitoring and has a strong practical value. 


Keywords: Nilaparvata lugens, the Internet of Things, farmland information, 
sensor nodes, hardware. 


1 Introduction 


The Internet of Things is an emerging information network and is described as a self- 
configuring wireless network of sensors whose purpose would be to interconnect all 
things [1]. The concept is attributed to the former Auto-ID Center, founded in 1999, 
based at the time at the Massachusetts Institute of Technology (MIT) [2]. The Internet 
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of Things is neither science fiction nor industry hype, but is based on solid technolo- 
gical advances and visions of network ubiquity. Its appearance enables the whole 
world to be intelligently perceived and more mutually connected. Any objects will be 
able to exchange information and actively process information according to prede- 
fined schemes. In this vision of the future, is it easy to imagine things that are able to 
transport themselves: e.g. sensors in an electronic jacket can collect information about 
changes in external temperature and the parameters of the jacket can be adjusted ac- 
cordingly. There will be fully automated supply networks, autonomous warehouses. 

Nilaparvata lugens is one of the major pests of rice; not only in China but in most 
rice growing countries in South and Southeast Asia. Since rice crops are continuously 
cultivated in tropical Asia, both nymphs and adults of Nilaparvata lugens damage rice 
plants through extensive feeding on them. Nilaparvata lugens also transmits viruses 
such as rice ragged stunt (RRSV) and rice grassy stunt (RGSV) [3], [4]. From 2005 to 
2006, more than 485,000 ha of rice production area in southern Vietnam were severe- 
ly affected by Nilaparvata lugens. It resulted in the loss of 828,000 tons of rice valued 
at US$120 million [5]. Thus, once the monitoring and prevention work is relaxed, it is 
very likely to cause serious damages in a large scale. Given the above cases, the tech- 
nology of the Internet of Things is used to monitor Nilaparvata lugens and environ- 
mental information. It provides an important basis to study the rules of the Nilaparva- 
ta lugens disaster and the key factors leading to the disaster. 


2 Wireless Sensor Network Architecture 


2.1 Wireless Sensor Network Structure 


The wireless farmland information acquisition system took distributed structure which 
is showed in Fig.1. 


Fig. 1. Sensor network architecture. 


The system consists of wireless sensor nodes, sink nodes, wireless bridge, and con- 
trol centre. The measure data transmit from the sensor nodes to the sink nodes using 
the ZigBee communication network. The data transmit from the sink nodes to the 
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server by network bridge. The sink nodes in this system act as gateway, it is the pro- 
tocol conversion used to transform a data package in ZigBee protocol to TCP/IP pro- 
tocol before transmitting and a data package in TCP/IP protocol to ZigBee protocol. 


2.2 Structure of Wireless Sensor Nodes 


In wireless sensor network, there are a lot of sensor nodes, which not only have hard- 
ware part, but also have software part. The hardware part consists of four hardware 
components of sensor modules, processor modules, wireless communication modules 
and power supply modules, while the software part is made up of the hardware ab- 
straction layer, the service layer and the application layer. The entire structure is 
shown in Figure 2(a). 
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Fig. 2. (a) Composition of wireless sensor nodes. (b) Protocol architecture of wireless sensor 
network. 


Power Management 


At the hardware layer, the sensor module is responsible for collecting information 
in sensor field. The processor module is responsible for coordinating the work of 
various parts of nodes. The wireless communication module is responsible for wire- 
less communication with other sensor nodes. The power technical module is respon- 
sible for providing the power required for sensor nodes. 

At the software layer, the hardware abstraction layer realizes abstraction on the 
hardware platform (modules of power supply, data acquisition, data processing and 
wireless communication), which hides the details of the specific platform’s hardware 
interface. The service layer includes communication services, sensor services, power 
management services and real-time kernel. The application layer is defined by users 
based on the needs of specific use, which uses the interface offered by the service 
layer to easily design top software. 


2.3 Wireless Sensor Network Protocol Stack 


Network architecture is a collection of protocol layering and network protocols in 
a network, which defines and describes the functions which a network and its 
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components should complete. The same as other networks, wireless sensor network 
also has its own protocol stack. Figure 2(b) shows the protocol architecture of wire- 
less sensor network, which is different from a traditional computer network [6], [7]. 
The application of different software at the application layer can achieve different 
application purposes of sensor network. The transport layer provides functions of 
error control and flow control. The network layer is responsible for routing the data 
offered by the transport layer to information collection nodes. The data link layer is 
mainly responsible for nodes’ access so as to reduce nodes’ transport conflicts. The 
physical layer is responsible for bit-stream transport. 


3 Hardware Design of Sensor Nodes 


3.1 The Overall Structure of Hardware 


The Internet of Things is a wireless sensor network where many sensor nodes are 
arranged in certain areas in a random way. Sensor nodes are the most important and 
basic components of the Internet of Things. Different sensor nodes have different 
function: e.g. humidity sensor nodes acquire humidity, image sensor nodes capture 
images, and temperature sensor nodes acquire temperature, etc. No matter what type 
of sensor nodes, in addition to the differences in sensor modules, the composition of 
other parts is almost the same. A sensor node is actually an embedded system which 
is consists of the microprocessor, memory, sensors, network interface, camera inter- 
face and power supply. The Figure 3 shows the structure. 


Application p 


Power 


Sensors 


Video 
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Fig. 3. The overall hardware design of sensor nodes in the Internet of Things. 


3.2 The Processor Module 


The processor module is the core of sensor nodes. It completes the system resource 
management, task scheduling, information exchange, command analyzing, 
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multi-channel sampling, and other tasks. The selection of processors is essential to the 
design of sensor nodes. In this project, the processor module selects NXP Semiconduc- 
tors's LPC2478 powered by the ARM7TDMI-S core, which is designed for a wide 
range of applications that require advanced communications. The LPC2478 microcon- 
troller incorporates an LCD controller, a 10/100 Ethernet Media Access Controller 
(MAC), a USB full-speed Device/Host/OTG Controller with 4 kB of endpoint RAM, 
four UARTs, two Controller Area Network (CAN) channels, an SPI interface, two Syn- 
chronous Serial Ports (SSP), three I2C interfaces, an I2S interface, 32-bit timers, a 10- 
bit ADC, a 10-bit DAC, two PWM units, and up to 160 fast GPIO lines. All of these 
features make the LPC2478 particularly suitable for the hardware’s requirements on a 
processor in this paper. 


3.3. Temperature Sensors 


The temperature sensors used in this project are high precision digital sensors 
DS18B20 designed by Dallas Company. The core functionality of the DS18B20 is its 
direct-to-digital temperature sensor and the resolution of the temperature sensor is 
user-configurable to 9, 10, 11, or 12 bits. The default resolution at power-up is 12-bit. 
The DS18B20 uses Maxim’s exclusive 1-Wire bus protocol that uses single signal 
lines to transmit not only the clock but also data. In additional, each DS18B20 has a 
unique 64-bit serial code. This feature allows multiple DS18B20s to function on the 
same 1-Wire bus. Thus, it is simple to use one microprocessor to control many 
DS18B20s distributed over a large area. Its circuit diagram is shown in Figure 4(a). 
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Fig. 4. (a) The circuit diagram of temperature sensors. (b) The circuit diagram of humidity 
sensors. 


3.4 Humidity Sensors 


The humidity sensor uses SHT11 produced by the Swiss Sensirion, which is Sensi- 
rion’s family of surface mountable relative humidity and temperature sensors. It wide- 
ly applied in HVAC, automobiles, consumer electronics, automatic control and other 
fields and the applied CMOSens technology guarantees excellent reliability and long 
term stability. The SHT11 has many characteristics, such as programmable 
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regulation of measurement resolution (8/12/14 bit data), a tiny foot print, 2-wire serial 
interface, low power consumption, etc. Its circuit diagram is shown in Figure 4(b). 


3.5 The Wireless Communication Module 


The wireless communication module uses Texas Instruments’s CC2430 which can 
meet the need of high performance and low power in 2.4 GHz IEEE 802.15.4 band 
based on ZigBee technology. The CC2430 is a true System-on-Chip (SoC) solution, 
which combines a high-performance 2.4GHz DSSS (Direct Sequence Spread Spec- 
trum) RF transceiver core and an industrial-grade compact 8051 controller. The en- 
hanced 8051 MCU has 128KB programmable flash memory, 8KB RAM and many 
other powerful features. The CC2430 is highly suited for systems where ultra low 
power consumption is required. Its working current loss is 27 mA. In receiving and 
transmitting mode, the current consumption is respectively less than 27 mA or 25 mA. 
Its circuit diagram is shown in Figure 5. 
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Fig. 5. The circuit diagram of wireless communication module 


3.6 The CMOS Camera Module 


In this project, the OV6620 CMOS Image sensors is used to capture image, which are 
single-chip video/imaging camera devices designed to provide a high level of functio- 
nality in a single, small-footprint package. The OV6620 incorporates sensor includes 
a 356 x 292 resolution image array capable of operating up to 60 frames per second 
image capture, an analog signal processor, dual 8-bit Analog-to-Digital converters, 
analog video multiplexer, digital data formatter and video port, SCCB interface and 
registers. Thereinto, the video output can be programmed to provide video output in 
4-bit/8-bit/16-bit digital formats. The SCCB interface provides access to all of the 
device's programmable internal registers including exposure control, gamma, gain, 
white balance, color matrix, windowing, and more. Its circuit diagram is shown in 


Figure 6. 
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Fig. 6. Schematic circuit diagram of CMOS camera module 


3.7. The Ethernet Interface 


This system uses ENC28J60 network chip as the Ethernet interface. The ENC28J60 
chip produced by Microchip is a highly integrated and fast Ethernet controller with an 
industry standard Serial Peripheral Interface (SPI). It is designed to interface directly 
with the SPI port available on many microcontrollers. The ENC28J60 is IEEE 802.3 
compatible Ethernet controller and integrates MAC and IOBASE-T PHY. It also 
provides an internal DMA module for fast data movement and hardware assisted 
checksum calculation for various network protocols. Communication with the host 
controller is implemented via an interrupt pin and the SPI interface with clock speeds 
up to 20 MHz. Its circuit diagram is shown in Figure 7. 
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Fig. 7. Schematic circuit diagram of Ethernet module 
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4 Conclusion 


It is the hot field of recent international research for the Internet of Things. In the 
other hand, recent advances in wireless communication, semiconductor, embedded 
system and sensor technology have made it possible to build low cost, low power 
consumption, high performance wireless sensor nodes. In this paper, the Internet of 
Things is used in the Nilaparvata lugens monitoring and warning system. The sensor 
nodes in the Nilaparvata lugens monitoring system based on the Internet of Things is 
also designed, which uses high performance arm processor as control core to realize 
the data collection, storage and wireless transmission, improving the system speed 
and transfer efficiency. This paper has introduced its overall structure, and described 
in detail the hardware design and implementation of the various components of sensor 
nodes. The future work is mainly reflected in the selection of CPU with lower costs, 
smaller size, and lower power consumption to replace LPC2478. Through the use of 
small-size components, the size of nodes can be further reduced. 
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Abstract. Current algorithms of the dynamic programming or longest increas- 
ing subsequence with the time complexity of O(n’) or O(mlogn) can only find 
one maximum non-crossing subset of nets even if there is more than one with 
the same length. In order to traverse all possible ones in the VLSI (Very Large 
Integration Circuits) wire routing, a dynamic programming algorithm is adopted 
and modified to calculate the length of maximum non-crossing subsets. For this 
purpose, an adjacent list is created for the traverse and a recursive function is 
used to output all maximum non-crossing subsets of nets. The effectiveness of 
the algorithm with a time complexity of O(n’) is illustrated through the theoret- 
ical analysis and experimental results of corresponding C++ program. 


Keywords: maximal non-crossing subsets; dynamic programming; adjacent 
list; recursive function; circuit wiring. 


1 Introduction 


A circuit consists of a set of modules and a set of nets. Each net specifies a subset of 
points, called terminals, on the boundary of the modules. The layout problem is to 
interconnect the modules as specified by the nets in terms of different technological 
design rules. Due to the complexity of the problem, VLSI layout design is typically 
decomposed into three phases: placement, global routing, and detailed routing. In the 
placement phase, circuit modules are geometrically positioned on a layout surface 
(chip). In the global routing phase, the routing region is partitioned into simple sub- 
regions, each called an elementary region, and global assignment of the wiring paths is 
determined for each net. In the detailed routing phase, detailed wirings of the individu- 
al routing regions are given. [1] The crossing distribution problem occurs before the 
detailed routing. It is observed that nets crossing each other are more difficult to route 
than those nets that do not cross. The layout of crossing nets must be realized in more 
than two layers, thus requiring a larger number of vias. [2] 

Documents [3, 4] and [5] studied the problem of maximum non-crossing subset 
(MNS) of nets using dynamic programming [6, 7] and longest common subsequence 
[8] respectively. They have the complexity of O(n’) and O(nlogn) respectively. How- 
ever, these algorithms can only find one maximum non-crossing subset of nets even if 
there is more than one with the same length. In order to traverse all possible ones in the 
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VLSI (Very Large Integration Circuits) wire routing, a modified dynamic program- 
ming algorithm is brought forward in this paper. The effectiveness of this algorithm 
whose time complexity is O(n’) is illuminated by the theoretical analysis and experi- 
mental results of corresponding C++ program. 

The remainder of the paper is organized as follows. Section 2 introduces the prob- 
lem of maximum non-crossing subsets of nets in the circuit wiring. Section 3 illustrates 
the core thoughts of the algorithm using dynamic programming and analyzes the time 
complexity of the algorithm. Section 4 is the C++ program implementation corres- 
ponding to the algorithm. Section 5 provides the experimental results of the C++ pro- 
gram corresponding to the algorithm. The last section concludes and gives the 
directions for future work in this field. 


2 Maximum Non-crossing Subsets of Nets 


A routing channel has n pins on the top side and n pins with a permutation C on the 
bottom side. Pin i on the top side of the channel is to be connected to pin C; on the 
bottom side,1< 7<n. The pair (i, C,) is called a net. In total, we have n nets that are to 
be connected or routed. Suppose that we have two or more routing layers, of which one 
is a preferred layer. For example, in the preferred layer it may be possible to use much 
thinner wires, or the resistance in the preferred layer may be considerably less than in 
other layers. Our task is to route as many nets as possible in the preferred layer. The 
remaining nets will be routed, at least partially, in the other layers. Since two nets can 
be routed in the same layer if they do not cross, our task is equivalent to finding a max- 
imum non-crossing subset (MNS) of the nets. Such a subset has the property that no 
two nets of the subset cross. Since net (i, C;) is completely specified by i, we may refer 
to this net as net i.[3] 

Consider the example in figure 1, the nets ( 1,8) and ( 2,7) ( or equivalently, the 
nets | and 2) cross and so cannot be routed in the same layer. The nets (1,8), (7,9), and 
(9,10) do not cross and so can be routed in the same layer. These three nets do not 
constitute a MNS as there is a larger subset of non-crossing nets. There are two MNS 
of the routing instance given in figure 1. They are four nets {(4,2),(5,5),(7,9),(9,10)} 
and {(3,4),(5,5),(7,9),(9, 10) }. 


L 


“C=[8,7,4,2,5,1,9,3,10,6] 


Fig. 1. A wiring instance. 


3 Algorithm of Dynamic Programming 


The algorithm outlined below finds MNS with dynamic programming efficiently, us- 
ing only arrays and an adjacent list [10]. It processes the sequence elements in order, 
maintaining the MNS found so far. 
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Array element dp/i] represents the length of LIS that ends on the position i. The 
state transition equation is dp[i/=max{ dp[j]+1 | 1 Sj<i, bottom[j]<bottom[i] }. Here 
position j is the possible predecessor of position i in LIS. 

Adjacent list with virtual source node S and target node T can be created based on 
the array dp[ ]. Variable length is the length of the longest increasing subsequence. 
Elements with dp[iJ=length and dp[i]=1 are linked to the source node S and target 
node T respectively. Predecessor elements are linked to their successor element. 

Depth-first search (DFS) retraces the adjacent list from the source node S to the 
target node T and then outputs MNS sequentially. 

The algorithm, then, proceeds as follows. 


input circuit wiring; 
FOr 22S 2 dae STE 
(for; Slay 25.) ee EL 
find possible predecessor j of i in LIS 
such that wire[j].bottom< wire[i].bottom; 
calculate length dp[i] of every wire in LIS; 
create an adjacent list of all LIS based on dp[ ]; 
recursively output all MNS using DFS. 


The time complexity of the algorithm is equivalent to that of calculating the length of 
LIS, and it is O(n’). 


4 C++ Implementation 


4.1 Design 


The methodology of top-down modular is adopted to design the program. Therefore, 
we considered the structure of the C++ program with three big modules: |. inputting 
the circuit wiring, 2. the central module to find the MNS using dynamic programming, 
3. outputting the result. 

The second module computes the length of the MNS and finds the MNS using the 
dynamic programming. This module is further divided into three sub modules: “LIS” 
module, “ConstructAdjacentList” module and “addEdge” module. The last module 
outputs all MNS implied in the adjacent list. This module is further divided into two 
sub modules: “OutputMNS” module and “DFS” module. A seventh module “Wel- 
come” that displays the function of the program is also desirable. While this module is 
not directly related to the problem at hand, the use of such a modular enhances the 
user-friendliness of the program. 


4.2 Program Plan 


In last section, we have already pointed out the need for seven program modules. A 
root (or main) module invokes five modules in the following sequence: “Welcome” 
module, “InputCircuitWiring” module, “LIS” module, “ConstructAdjacentList” mod- 
ule and “OutputMNS” module. “ConstructAdjacentList” and “OutputMNS” modules 
invoke “addEdge” and “DFS” modules respectively. 
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A C++ program is designed by following the modular structure in figure 2. Each 
program module is coded as a function. The root module is coded as the function 
“main”; “Welcome”, “InputCircuitWiring”’, “LIS”, “ConstructAdjacentList’, ‘“ad- 
dEdge”, “OutputMNS”, and “DFS” modules are implemented through different 
functions. 


main 


ConstruetA djacent 


Welcome Input@ireu it Wiring, LI 
List 


OutputMNS 


addHdac DFS 


Fig. 2. Modular structure. 


4.3 Program Development 


Function “Welcome” explains the function of the whole C++ program. Function “In- 
putCircuitWiring” informs the user that the input is expected as a permutation of pins 
on the bottom of the channel corresponding to the pins on the top. The size of the 
group of the pins is determined first, so the total number of pins is needed before an 
input begins. In our program, the input process is implemented by importing the input 
data from a text file called “input.txt”. The result is outputted to the text file 
“output.txt”. 

Struct wire consists of top side top and bottom side bottom. Bottom side is actually 
the wire permutation (i.e., C; corresponding to i). The following figure 3 details the 
functions of “LIS”, “addEdge” and “constructAdjacentList”. The first and last func- 
tions are invoked by main function. The function “addEdge” is invoked by function 
“constructAdjacentList’”. The function “addEdge” helps to construct adjacent list. 

The function “LIS()” calculates the length and implicitly helps to construct adja- 
cent list of LIS for the later output using dynamic programming. Integer length 
represents the length of LIS. Array element dp/i] represents the length of LIS that 
ends on the position i. The state transition equation is dp/i/=max{ dp[j]+/ | 1 Sj<q, 
bottom[j]<bottom[i] }. There are double loops in the function “LIS”. The inner loop 
finds the possible predecessor of dp/i]. The outer loop calculates the length of current 
wire in the LIS. When the whole loop terminates, the length of LIS (i.e., MNS) is 
achieved. 

The function “addEdge (int u, int v)” adds an edge (u, v). The function “Construc- 
tAdjacentList” constructs adjacent list for the later output via function “DFS”. At first, 
two added nodes, source node S and target node T are created. Then, the adjacent list is 
initialized. Next, there are loops of two layers. In the outer loop, the last and first ele- 
ments of LIS are linked to the source node S and target node T respectively. In the 
inner loop, predecessor elements are linked to their successor element. Array of length 
dp[ ] and array of struct wire wire/ ] are used in the condition of judgment. 
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int LIS(int n) //find LIS using dynamic programming, 0(N*2) 
< int length=6,i,j; 
//memset(dp,-1,sizeof(dp)); 
for(i=1 ji<=n ji++) 
rf int tmp=6; 
for(j=1 5j<i 5j+*) 
{  ##find possible predecessor of dp[i] 
if(wire[j]-bottom<wire[i].bottom && dp[j]>tmp) 
; tmp=dp[j]; 
dp[i]=tmp+1; 
if (dp[i]>length) //find maximal length of LIS 
length=dp[i]; 


} 
return length; 
} 
inline void addEdge(int u,int v) ffadd an edge (u,v) 


{g[u] -push_back(v) 5} 
void ConstructAdjacentList(int n,int len) //ConstructAdjacentList 
< $=6,T=n+1; 
int i,j; 
for(i=6 ji<=n+1 5i++) 
{g[i]-clear();} 
for(i=1 ji<=n ji++) 
< if(dp[i]J==len) //The last element of LIS is linked to the node S. 
addEdge(S,i); 
if (dp[i]==1) /#/The first element of LIS is linked to the node T. 
addEdge(i,T); 
for(j=1 5j<i 5j++) 
{if(dp[iJ==dp[j]+1 && wire[i].bottomwire[j].bottom) 
addEdge(i,j); 
} 


Fig. 3. LIS, addEdge and ConstructAdjacentList functions 


In figure 4, function “OutputMNS” initializes variable pathNum with zero first, then 
invokes the recursive function “DFS(S, 0, len, pathNum)”. Function “DFS(int u, int dep, 
int len, int& pathNum)” retraces from the source node S to the target node T using DFS, 
and then outputs MNS sequentially. Here the source node S is the successor of the last 
element and the target node T is the predecessor of the first element in the LIS. 

The details of other functions “Welcome”, “InputCircuitWiring” and “main” are 
omitted here. The effects of these functions will be illustrated in the next section. 


void DFS(int u,int dep,int len,int& pathNum) #/DFS outputs all MNS 
{ int i; 
if (dep==len+1) 
< printf("The path %d of MNS:\n",++pathNum) ; 
printf(“top\tbottom\n") ; 
for(i=dep-2 ;i>=6 ;i--) 
printf ("Sd\t%d\n" ,wire[path[i]].top,wire[path[i]]-bottom) ; 
} 
for(i=68 ;i<g[u]-size() ;i++) 
{ — path[dep]=g[u][i]; 
DFS(g[u][i],dep+1,len,pathNum) ; 


> 

void OutputMNS(int n,int len) 

{ int pathNum=6; 
printf("The length of HNS is %d.\n",len); 
printf("The following is all maximum non-crossing subsets of nets.\n"); 
DFS(S,6,len,pathNum) ; 


Fig. 4. DFS and OutputMNS functions 
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5 Experimental Results 


5.1 Example Illustration 


TABLE I shows the dynamic changes of some variables and arrays in the process of 
program running for the example in figure 1. Variable i represents the time of loop. It 
is the position of pins on the top side, too. Array element wire/i].bottom records the 
position of pins on the bottom side of the current wire. Array element dp/i] represents 
the length of LIS that ends on the position i. 


Table 1. Dynamic changes of some variables and arrays 


i 1} 2] 3] 4) 5| 6} 7] 8] 9] 10 
wire/i].bottom 6 
dp[i] 3 


AB 
tat 
L311 
ae 
[3/73/71 4) 
a 
1 
L814 -.8 
|40;— 5. 8 
LT) 


Fig. 5. Output of all maximum non-crossing subsets 


In order to elaborate more clearly, the above figure 5 is converted to its equivalent 
figure 6. Two paths marked with red color can be found starting from the source node 
S and ending at the end node T. They areS 9 10 99935347 TandS>1099 
> 5227 T. All maximum non-crossing subsets are achieved after the previous 
sequences are reversed. They are2 >5397> 10 and495397 10. 


{ NW NC N 
8/7/4/2/5/1/9/3|10/6 


Fig. 6. Output of all maximum non-crossing subsets 
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5.2. Final Output 


In figure 7 our program outputs length and all maximum non-crossing subsets of 
nets for the example in figure 1. The length of MNS is 4. Set {(3,4),(5,5),(7,9),(9, 10) } 
is a MNS of nets {(1,8),(2,7),(G,4),(4.2),(5,5),(6,1),(7,9),(8,3),(9, 10),(10,6)}. And 
{(4,2),(5,5),(7,9),(9,10)} is another MNS of nets in figure 1. 


TUTE 
This program finds all maximum non-crossing 
subsets of nets using dynamic programming. 
LLL 
Input the size of wires: 
Input the array of pins on the bottom side: 
The length of MNS is 4. 
The following is all maximum non-crossing subsets of nets. 
The path 1 of MNS:. 


top bottom. 
3. 4 

> 39 

y ee 

9 10 

The path 2 of MNS: 
top bottom 
4 2 

Ss 4 

7 #9 

9 10 


Fig. 7. Output of all maximum non-crossing subsets 


6 Conclusion 


In this paper, a modified dynamic programming algorithm with the complexity O(n’) 
to output all maximum non-crossing subsets of nets and to calculate its length in the 
circuit wiring was presented. Furthermore, the data structure of adjacent list is applied 
in the algorithm to improve the efficiency. 

The complexity of this algorithm is analyzed theoretically. In contrast to this, the 
previous algorithms of using either the dynamic programming or longest increasing 
subsequence can only find one maximum non-crossing subset of nets even if there is 
more than one with the same length. Further, a C++ program is developed to test this 
enhanced algorithm, and the satisfactory experimental results of the C++ program are 
presented. 
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Though some achievements about maximum non-crossing subsets of nets are made 


in this paper, more efficient algorithm of minimizing wiring congestion is our next 
research task. 
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Nonlinear Sample-Data System 
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Abstract. This paper deals with non-fragile 7. control with pole 
constraints for a class of nonlinear sample-data system. Firstly, the con- 
tinuous control plant of sampled-data system is transformed into an un- 
certain discrete system with bounded nonlinearities model. Then, the 
pole constraints theory and the linear matrix inequality approach are 
incorporated to design a non-fragile 71. controller against possible per- 
turbations, which results in the closed-loop system being D-stable and 
the system’s performance index being less than a prescribed scalar. Si- 
multaneously, the existence condition and the design approach of non- 
fragile controller are derived. Finally, simulation examples are presented 
to illustrate the feasibility of the proposed control algorithm. 


Keywords: Sampled-data system, Nonlinear system, Non-fragile con- 
trol, Pole constraints, Linear matrix inequality. 


1 Introduction 


Sampled-data control theory has been well-developed in the last two decades{I], 
and established methodology is available for analysis and synthesis, which is em- 
bodied by the following two aspects. 

On one hand, such control problem can usually be equal to designing proper 
controller such that the closed-loop system is asymptotically stable and its per- 
formance index is less than some prescribed scalar. However, most of these results 
are based on the accurate state feedback controllers|2)3/4)5]. In fact, due to the 
existence of the parameter drift, accuracy problem and other factors. the param- 
eters of the controller are possible to accrue gain variations. And relatively small 
perturbation of the controller parameters might destabilize the closed-loop sys- 
tem, even lead to the performance degradation. This is known as the non-fragile 
control problem[6]. So far this Problem has been widely investigated by many 
researchers . 
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On the other hand, it is desirable to construct sampled-data system to achieve 
better transient performance. To this aim, a more practical way is to put the 
closed-loop poles in a specified region ; 

In this paper, we will design the non-fragile H.. control with pole constraints 
for a class of nonlinear sample-data system. At the same time, the existence 
condition and design approach of non-fragile controller with pole constraints are 
presented. 


2 Preliminaries and Problem Statement 


Let the control plant of sampled-data system be given by 


&(t) = (Ao + AAo) x(t) + (Bo + ABo)u(t) + f(x, u, t) + Biw(t) (1) 
z(t) => Ci x(t) + Aw (t) 


where x(t) € R” is the state, and u(t) € R™ is the control input, z(t) € R! 
is measured output, w(t) € R? is the external disturbance input that belongs 
to L[0, oo], Ao, Bo, Bi, Ci, Hz are the constant matrices of appropriate di- 
mensions. f(z, u,¢) is the uncertain nonlinear function vector, and satisfies the 
Lipschitz condition with f(0,0,0) = 0. 


Assumption 1. The continuous plant is time-driven with a constant sampling 
period h(h > 0). 

Discretizing system (1) in one period, we can obtain the discrete state equation 
of the plant of sampled-data system 


= +1) = (Go + AGo)a(k) + (Hp + AHo)u(k) + f(x, u, k) + Hyw(k) (2) 


where 
Go = eAoh , Ho = ie eAo(h-“) du Bo, , m= fe eAo(h—w) diy By 


f(a,u,k) = i e4¥ dw f (x, u, t) 
AGo, AHp are uncertain matrix, and satisfy the following form 
[AGo AH] = MF(k)[Eo Ei] (3) 
In the sequel, we assume that the nonlinear uncertainty f(x, u, k) satisfies 
fT (a,u,k)f(a,u,k) < @7(k)QT, Quek) (4) 


where Q,; are known positive definite matrix. 
The objective of this paper is to design a non-fragile state feedback controller 


u(k) = (K+ AK)a(k) (5) 


Non-fragile 2. Control with Pole Constraints 589 


where, K is the nominal controller gain, and AK represents the gain perturba- 
tions. In general, there exist the following two types of perturbations: 
Type 1: the additive form 


AK = MF(k)E» (6) 
Type 2: the multiplicative form 
AK = MF(k)E3K (7) 


where M, E2 and E3 are known constant matrices, and F'(k) is uncertain pa- 
rameter matrix with satisfying FT(k)F(k) <I. 


Remark 1. The controller gain perturbations can result from the actuator degra- 
dations, as well as from the requirements for re-adjustment of controller gains 
during the controller implement stage[14]. These perturbations in the controller 
gains are modeled here as uncertain gains that are dependent on the uncertain 
parameters. The model of additive uncertainties [9] and multiplicative are 
used to describe the controller gain variations. 


Definition 1. A real matrix A is D-stable, i.e., has all its eigenvalues in the 
LMI region D,which can be denoted as o(A) C D, o(A) is spectral set of 
matrix A. 


Lemma 1. [15] Let Ac¢ R"*”be a given matrix. The eigenvalues of A belong 
to D(—a,r) if and only if there exists a symmetric matrix P € R"*” such that 


—p-! r(A+al) 


rie ap |? 


Lemma 2. [16] For given matrices Q = Q™, H, and E, with appropriate 
dimensions 
Q+HF()E+ ETF T(t)H™ <0 


holds for all F(k) satisfying FT(k)F(k) < I, if and only if there exists « > 0 
Qtenhn +e 8 Fu 


Lemma 3. [17] Let (G, H,C,D) be a minimal state space realization of system 
T(s), let 7 be a positive scalar. The system is stable and the inequality 


IT(S)lhoo SY 


is true, if and only if there exists a positive definite matrix P such that the 
following linear matrix inequality (LMI) holds: 


—-P+G™TPG GTPH cT 


H™PG -7I+HTPH DT| <0 
C D os 
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3 Main Result 


Theorem 1. Consider system(2), there exist scalars ¢; > 0,¢2 > 0 and matrices 
K, AK, X, = X? > 0, and Y; such that for all admissible uncertainties the 
following matrix inequality hold: 


—X, * * *k Ok * * * 
0 I * * ok * * * 
y31 —X1 * * ox * * * 
CiX, He 0 I x * . * 
? 0: we cea ae & 4 |S? (8) 
Yor 0 0 0 0 -exf * * 
0 0 eoMT He 0 0 0 -é.l x 
EoX, 0 0 0 0 0 QO —€ol 
—rX1 * * * * * 
Yai —rXy * * * * 
0 e3MT -esI x x . 
pat 0 O —é3l x * a (9) 
0 eaMT HE 0 0 -e4l x 
EyX 4 0 0 0) 0 —eql 


where 
Yai = 931 = GoX1 + QuX1 + AoY, 
par = Yor = fo X1 + £1Y, + Fo X1 


then u(k) = (K + AK )a(k) is a additive non-fragile H. control law with pole 
constraints of system (2) and CK =Y,X1 . 


Proof. system (2) can be transformed into the following form 


ee +1) = Ga(k) + Hyw(k) 


2(k) = Cya(k) + How(k) (10) 


where 


G=Go + Qi1+ AGo + HoK + Ho AK + AHoK + AHoAK (11) 


Owing to Lemma 3, one obtains 


—P x * Ox 
Orne oe oe 
CP ape (12) 
Ci Hp O TFT 
Substituting (9) into the left side of (10) yields 
0 000 
0 000 
W=Wi+ 61000 <0 (13) 


0 000 
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where 
| —P * * Ox 
0 yet «x 
Wi = —1 
Go + Qoo + HK H, —P™ * 
Ci Ay O TI 


Pi = AGo + Hy AK + AHoK + AHj AK 
From (12), and together with (3) and (6) yields 
W = W, + 10107 + €7 03 02 + £20303 + 67104 O4 (14) 


where a 
6:=[00MT0]’, 6: =[Eo+EoK + EK 000] 
63=[00MTH? 0)", 6 =[2000] 


From Lemma 2, (12) is equivalent to 


WW * * * * 
e107 —e,I* . * 

Oo 0 -e,l x * <0 (15) 
6203 0 0 eI x 


0, 0 0 0 —é9l 


Pre- and Post-multiplying (13) by diag{P~!, I, I, I, I, I, I, I} and its trans- 
pose respectively, one gets that (13) is equivalent to (7). 

On the other hand, the closed-loop system (8) must be D-stable, from 
Lemma 1, one obtains 


- T 
| vi Te | <0 (16) 
From Lemma 2 and Lemma 3, and together with (3) and (5) yields 


Wo * * * * 

6103 —e3I * * * 
6 0 —ée3l * * 

£207 0 0 -e,I x 
Og 0 0 O —-eql 


<0 (17) 


where 
—rP * 


WA Gs de Og EC SPA 


6;=[0MT]", 6¢=[Eot+ EiK + E20] 
67 = [0 MTHE]” , 63 = [E20] 


Pre- and Post-multiplying (13) by diag {P~1, I, I, I, I, I} and its transpose 
respectively, one gets that (16) is equivalent to (8). Thus, this completes the proof. 
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Theorem 2. Consider system(2), if there exist scalars ¢; > 0,¢€2 > 0 and 
matrices K, AK, X2 = XF > 0, and Y2 such that for all admissible uncertainties 
the following matrix inequality hold: 


—XQ * * kOe kK ok x 
0 71 * KOK Ok kx 
v31 Ay —X9 kk OK Ok Ok 
Ci X2 Ae 0 IT*« * * x 
00: Sev eer on te 8) 
Ye. 0 0 0 0 esl * x 
0 0 e6(HoM)™0 0 O eel * 
E3sY 0 0 00 0 O eg¢l 
yl * * * * * 
wa -y¥X2 a a 
0 eyMT -e7I x * * 
Wat 0 0 -e7l «x * <0 (19) 
0 egMT Ho 0 O -—égl x 
E3Y> 0 0 0 0 —égI 


where 


War = W31 = GoX2+ Qi1X2 + HoY2 
War = Ver = Eo Xo + LE, Yo + Ey X2 


then u(k) = (K + AK)x(k) is a multiplicative non-fragile H,, control law with 
pole constraints of system (2) and K = Y2X2 . 


Proof. Its proof is same with theorem1, so omitted. 


4 Numerical Example 


Consider the control plant of the sampled-data system with 


T 
—1 45 0 0.5 1 
Ao= | 0 ae Ao HE loa: Ci = | , M1, = Hz = 0.1 


2 3 0 0.1 0 —0.2 0 1 
pee ee ee Ee i a Fe 0.1 I. B.=|_5o| F=|01,| 


The disk D(-1.0,1.8) is given, according to Theorem1, solve the corresponding 
non-fragile with pole constraints problem via the solver feasp in LMI toolboxes, 
it yields 


0.4612 —0.7026 


X,= ee 0.7399 | , Yi = [0.2435 0.2001 | 


K = [-2.0450 —1.6890] 
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Fig. 1. The eigenvalues distribution under Fig. 2. The eigenvalues distribution under 


additive variation 


5 Conclusion 


multiplicative variation 


We have addressed the problem of non-fragile 7. control with pole constraints 
for a class of nonlinear sample-data system, and non-fragile controller is derived 
by solving a set of LMIs, which guarantees that the closed-loop system is D- 
stable and system’s performance index is less than a prescribed scalar. Finally, 
simulation example are presented to illustrate the feasibility of the proposed 


control algorithm. 
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A Dual-Band Tapered Slot Omni-directional Antenna 
with an Orthogonal Polygon Parasitic Element 
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Abstract. A dual-band tapered slot omni-directional antenna with an orthogonal 
polygon parasitic element which covers GSM/DCS/PCS/CDMA2000/WCDMA 
/TD-SCDMA/WLAN is proposed experimentally and numerically. The pro- 
posed antenna comprises of two shunted tapered slot antennas. The tapered slot 
antenna is an asymmetric structure which is made up of a straight line and an 
exponential line. A polygon parasitic element which improves the radiation pat- 
tern is orthogonal with the radiator. The measured 14-dB impedance bandwidth 
of the antenna is 29.5% and 54.6% at the lower band and the higher band, re- 
spectively. And the measured 10-dB impedance bandwidth is 129.2% 
(0.69GHz-3.21GHz) centered at 1.95GHz, which is about 46 times that of the 
corresponding monopolar wire-patch antenna. The antenna is successfully si- 
mulated, designed, and measured, showing dual-band impedance bandwidth, 
stable gain and good omni-directional radiation patterns. 


Keywords: Tapered slot antenna, wideband antenna, omni-directional antenna, 
indoor base station antenna, dual-band antenna. 


1 Introduction 


With the development of the third generation mobile communication, more and more 
buildings and malls require the indoor base station antenna mounted on the ceilings 
and districts. In fact, many monopole antennas and sleeve antennas had been used 
widely, while the omni-directional radiation pattern with the vertical polarization was 
obtained [1-3]. However, the proposed antennas have large dimensions, narrow 
bandwidth and complex structures. 

In order to obtain a low-profile and wideband antenna which has omni-directional 
radiation patterns, a monopolar wire-patch antenna has been reported [1]. For this 
antenna, the wire monopole is top-loaded with a square patch. Two shorting wires are 
used to connect the patch to the ground plane. Under such configuration, the antenna 
height can be reduced to at the center operating frequency, as the antenna is operated 
at a resonance under the fundamental cavity mode. Another monopolar antenna was 
investigated in [3], which had wide bandwidth and monopole-like radiation pattern. 
But the antenna also has large size, which is too large to mount on the ceilings. In [4], 
a wide-band monopolar wire-patch antenna with L-probe fed is proposed for indoor 
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base station applications. However, the impedance bandwidth of these antennas are 
narrow. In [5-6], two sleeve antennas are investigated, the experimental results show 
that impedance bandwidths of about 50% are achieved. So, it can’t cover 
GSM/DCS/PCS/ CDMA2000/WCDMA/TD-SCDMA/WLAN operation bands. 

In this paper, a dual-band tapered slot omni-directional antenna with an orthogonal 
polygon parasitic element is presented experimentally and numerically. By properly 
selecting the straight line, the exponential line, the ground plane, the height of the 
radiator, and the orthogonal parasitic element of the slot antenna, a dual band antenna 
with wide impedance bandwidth, small size, good radiation characteristics suitable for 
the GSM, CDMA, CDMA2000, WCDMA, TD-SCDMA, DCS, WLNA (2.4GHz- 
2.483GHz) base station applications could be obtained. Details of the antenna design 
and both the numerical and experimental results are presented and discussed. 


2 Antenna Design 


The basic geometry of the proposed antenna consists of two tapered slot antennas 
which are formed by combining the straight line and the exponential line, orthogonal 
parasitic element, the ground plane. The tapered slot was supported by FR4 with rela- 
tive permittivity 4.4 and thickness 1.6mm. The structure of the antenna is shown in 
Fig.1. The straight line and the exponential line can be described by equation (1) and 
equation (2), respectively. 


Radiator 


‘Top View 


Fig. 1. Geometry of the proposed antenna 
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z=k(x-A)-B (1) 


yee Bees 5 (2) 
Where, x is the horizontal ordinate and z is the height of the antenna. In this design, k, 
A, B, C, p, D, E are constant. And E is zero. From the Fig.1, the proposed antenna 
with an orthogonal polygon parasitic element is fixed on a ground with diameter D 
like a monopole antenna. And the two tapered slot antennas which are modified Vi- 
valdi antennas are shunted to obtain good omni-directional radiation patterns. The 
orthogonal polygon parasitic element utilized in the paper is similar to the radiator, 
only the exponential line was replaced by a tapered structure. The parasitic element is 
also orthogonal with the radiator to improve the radiation characteristics effectively 
[7]. However, very different from the conventional tapered slot antenna, the proposed 
antenna combined two tapered slot antenna which is shunted to form an omni- 
directional antenna. The asymmetrical structure also gives a good omni-directional 
radiation pattern which can well meet the wireless and mobile communication appli- 
cations. For achieving the dual-band and wide-band operation, the dimensional para- 
meters of the proposed antenna, were all first iteratively approached from the High 
Frequency Structure Simulator (HFSS) and then adjusted from experiment. Finally, 
the optimal antenna dimensions were obtained and shown in Table 1. 


Table 1. Dimension of the proposed antenna(in mm) 


Paramters Value 
L 56 
Ll 45 
L2 43 
L3 43 
L4 56 
LS 56 
L6 9 
g 1.8 
H1 10 
H2 44 
H3 85 
H4 32 
H5 48 
H6 94 
D 120 
D3 39 


3 Parameters Study 


As the proposed idea above, the key parameters were obtained by using HFSS. Due to 
the diameter of the ground D and the distance of the slot g play an important role in 
the impedance of the antenna. So, the two parameters are selected in the parametric 
study. In order to obtain accurate influence of the two parameters on its impedance 
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S, (48) 
S, (dB) 
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Fig. 2. Effect on the reflection coefficient of the key parameters vs. frequency 


bandwidth, only one parameter is changed at each time while other parameters are 
kept constant which is listed in table 1. 

The effects of the two key parameters mentioned above on the return loss vs. fre- 
quency are plotted in Fig.2 (a) and (b), respectively. From the Fig.2 (a), it can be seen 
that the resonant frequency of the both band is moving to the higher frequency with 
the distance between the straight line and the exponential line g increased, whereas 
the resonant frequency will reduce by decreasing the distance g. The distance plays an 
important role in improving the impedance bandwidth and the resonant frequency. 
Fig. 2(b) gives that the effects of the diameter of the ground D. The resonant frequen- 
cy at the lower band has little change, but the resonant frequency at the higher band 
changes rapidly. In order to broaden the impedance bandwidth of the antenna, the 
radiator and the orthogonal parasitic element are also cut as a tapered structure. The 
proposed antennas with orthogonal polygon parasitic element and without orthogonal 
polygon parasitic element are realized by HFSS. The effect on the impedance band- 
width of the orthogonal polygon parasitic element is shown in Fig.3. From Fig.3, the 
orthogonal parasitic element has little effect on the impedance. But the orthogonal 
parasitic element can improve the omni-directional radiation pattern which is realized 
in [7]. So, the antenna can be optimized by adjusting the distance g, the ground plane 
D and the tapered slot antenna structure. 


—#— without parasitic element 
- @ -with parasitic element 
- + r T r 


T ¥ T T 1 
1.0 15 20 25 3.0 36 
Frequency(GHz) 


Fig. 3. reflection coefficient of the proposed antenna with and without parasitic 
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4 Results and Discussions 


The proposed antenna is optimized by HFSS and the optimized antenna is manufac- 
tured. The prototype of the proposed dual-band tapered slot antenna with an ortho- 
gonal parasitic element is shown in Fig.4. To evaluate the performance of the 
optimized antenna, the proposed antenna was tested. The detailed dimensions are 
listed in table 1. The size of proposed antenna is reduced by 33.3%, which is smaller 
than the previous structure in [1-3] and more suitable for the indoor base station ap- 
plication installed on ceilings and suburban area. In this section, the simulated and 
measured return loss, Gain and the measured radiation pattern of the proposed anten- 
na are presented and discussed. The simulations are given by using HFSS, and the 
measurement is achieved by using HP8757D network analyzer and an anechoic 
chamber. Fig.5 shows the simulated and measured return losses of the antenna. It can 
be seen from the Fig.5, the measured result is well agree with the simulated one with 
an acceptable discrepancy. The differences between the simulated and measured val- 
ues may be due to the errors of the manufactured antenna and the N-type connector to 
the feeding probe, which is included in the measurements but not taken into account 
in the calculated results. For return loss less than -14dB, the measured dual-band 
impedance width are about 29.5% (0.75GHz-1.01GHz) centered at 0.88GHz and 
54.6% (1.57GHz- 2.75GHz) centered at 2.16GHz. And the measured 10-dB imped- 
ance bandwidth is 129.2% (0.69GHz-3.21GHz) centered at 1.93GHz, which is about 
45 times that of the corresponding monopolar wire-patch antenna [2]. The measured 
result also meets the GSM, CDMA, CDMA2000, WCDMA, TD-SCDMA, DCS, 
WLAN (2.4GHz-2.483GHz) applications. 


—#— Simulated 
—*— measured 


15 20 25 
Frequency(GHz) 


Fig. 4. Prototype of the proposed antenna; _‘ Fig. 5. Reflection coefficient of the antenna 


Fig.6 gives the radiation pattern of the proposed antenna at 0.824GHz, 0.96GHz, 
1.71GHz, 2.17GHz, 2.4GHz, respectively. It can be seen from the Fig. 6, the antenna 
has good omni-directional radiation pattern among the impedance bandwidth in the 
H-plane. In the E-plane, the radiation pattern of the antenna has monopole-like, which 
is bidirectional radiation pattern. But, the radiation pattern at the higher band has little 
change which may caused by the feed cable placed in the near field of the antenna and 
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the asymmetric tapered slot structure. Fig.7 shows the simulated and measured gains 
at the major point. In the lower band, the gain is more than 2dBi which is similar to 
the monopole antenna. In the higher band, the gain is more than 5dBi, which is raised 
by 2dBi compared to the antenna in [1-3]. It can be well fulfill the indoor base station 
applications. 
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Fig. 6. The radiation patterns of the antenna. 
(a)0.824GHz; (b)0.96GHz; (c) 1.71GHz; (d)2.17GHz; (e)2.4GHz 
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Fig. 7. Gain of the antenna 


5 Conclusions 


In the paper, a dual-band tapered slot omni-directional antenna consisted by two 
shunted tapered slot antenna with an orthogonal parasitic element is realized experi- 
mentally and numerically. The size of the antenna is reduced about 33.3% and the 
gain is 2dBi higher than the previous similar antenna. From the experimental results, 
return loss less than 14-dB, the measured dual-band impedance bandwidth is about 
29.5% at lower band and 54.6% at higher band. And the measured 10-dB impedance 
bandwidth is 129.2% at the center frequency. Owing to the results, the antenna can be 
applied for indoor base station communication applications. 
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Abstract. Both manifold learning and semi-supervised learning have been 
widely investigated in the past few years. Some of manifold learning algo- 
rithms, which are based on the idea of local approximation, can be used to 
control the way of transmitting information between point clouds. We combine 
local approximation with the idea of preserving projections and weighted inte- 
gration, and give a set of solutions to semi-supervised regression and manifold 
alignment. Finally, we validated the effectiveness of the presented schemes in 
the experiments. 


Keywords: semi-supervised learning, manifold learning, classification, mani- 
fold alignment. 


1 Introduction 


Many high-dimensional data in real-world applications can be modeled as data points 
lying close to a low-dimensional nonlinear manifold. Manifold learning algorithms 
aim at recovering the embedded low-dimensional manifold to study the primary prop- 
erty of the high-dimensional data. Most of them are based on preserving some metric 
info of the sample space. For example, Isometric Mapping(ISOMAP) [1] holds global 
geodesic distance; Diffusion Maps [2] preserves a kind of global metric defined by 
gauss kernel function; Hessian Eigenmaps Locally Linear Embedding(HLLE) [3] 
maintains that the low-dimensional data representation is locally isometric. Some 
algorithms are based on the idea of local approximation, such as Locally Linear Em- 
bedding(LLE)[4], Laplacian Eigenmaps (LE)[5] and Local Tangent Space Align- 
ment(LTSA)[6]. Some others are modified from local approximation algorithms, and 
preserve projections in the transformation of high-dimensional data to low- 
dimensional data. For example, Linearity Preserving Projection(LPP)[7], Neighbor- 
hood Preserving Embedding(NPE) [8], and Lineal Local Tangent Space Alignment 
(LLTSA)[9] preserve a global linear projection; Locally Linear Coordination(LLC) 
[10~14] preserves local linear projections between every local dimension reduction 
coordinates and final coordinates. We combine local approximation with the idea of 
preserving projections and weighted integration, and give a set of solutions to semi- 
supervised regression and manifold alignment. Here, classification is thought of as a 
special case of regression. 
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The rest of this paper is organized as follows: Section 2 introduces the local ap- 
proximation. Following in section 3 applications of Local approximation is devel- 
oped for local approximation algorithms. We give an analysis and experiments of 
SLTSA and others on classification in Section 4. Finally, the conclusions are given 
in Section 5. 


2 Local Approximation 


Suppose we have a Dxn matrix X consisting of n datavectors x, with dimensionali- 


ty D , we need to transform it into a new dataset Y with dimensionality d , while pre- 
serving the main property of X . 

Local approximation is extracting some local approximation property from X to 
organize Y . There are two types of approximation, approximation of a point, and 
approximation of a block, and the former is a special case of the other one. 
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Fig. 1. Local approximation (a) Local approximation for point (b) Local approximation for 
block 
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LetT, be a vector of indices of points in the (k—1) -neighbor of x, ,T; = F . 1 is 


: ; : : 1 : 
the identify matrix, e is a vector of all 1’s, J=(U “ee” is a mean removal operator, 


X, =X,J is the local coordinate of the high-dimensional data and its counterpart is 


Y. =¥,J Given coefficient w,,,, the approximation to a point is ¥,,w,,; > Y,. 
2.1 Point Approximation 


= i 
Let T, be a vector of indices of points in the (k —1) -neighbor of x,, and I, -| isa 


vector including i and I,. S, is a 0-1 selection matrix satisfying XS, = X= , similarly 


T, 


in the low-dimensional space, we have YS, = Y. . Let e be the vector of all I's, 7, be 


: : : : 1 f 
the identify matrix with rank k. Then, J =(/ ae) is a mean removal operator. 
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We can get the local coordinate of the high-dimensional data x = XJ, and its 
counterpart in low-dimension Ye =¥_J. The approximation of a point is defined as 


ey w, > Y,, where w, is the local approximation vector, which is extracted from X . 


The point approximation error of y, is defined as 


1 
154) | 
—w, 


by summing the approximation error of each point, the total approximation error of 
points is 
1 
YS,J 
—w, 


where B, is a sparse matrix satisfying BOT, i)= i). 
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Wi 


Frobenius norm of a matrix. 


2.2. Block Approximation 


Similar to the definition of point approximation, the block approximation is defined as 
Y.W, >Y., where W, is the local approximation matrix around y, . 


The block approximation error around y, is defined as 


a a 
F, ~Yewif, =[s.7(0, Wi . 


err, = 


The summation of the approximation error of all local blocks is 


: (4) 


ern, = Dern, = > |¥S,J (1, -W, |, =|¥S,8, 
i=l i=l 


where S, =[5,,---,S,], B, = diag{J(I, -W,)--.J(Z, —W, )}- 


n 


3 Applications of Local Approximation 


Learning with the label info can be regarded[8] as the problem of approximating a 
multivariate function from labeled data points. The function can be real valued as in 
regression or binary valued as in classification. Learning with the label info can also 
be regarded as a special case of dimension reduction that maps all the data points in 


the label space. The label error of y, is defined as err, =s,|ly,- fil , where 


F =|f,,---.f, ]is the label value, s, is the flag to identify the labeled points satisfying 
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5,= i _ is : , and L is the collection of index of labeled points. By weighted 
ue oe 
combining the point approximation error and label error, we can get 
u 2 
Err, =>. (1-4, Perr, +a; ern, )= [v,(7,- Af. +I - Fal, (5) 
i=l 
optimal 
y* = FAAT(M, +Aa’)" (6) 


Here, M, =B,(,-A)U,—A)’ By, 4, = ca —a°)+a°)s, is the weight coefficient at y,, 
n 


p? 


1 is the number of labeled points, a° is the minimal weight coefficient set by user, 


and A=diag(a,;). We set the weight coefficient on the following intuition that: if the 


proportion of the labeled points is very small, we have to reduce our dependence on 
the knowledge only retained from the labeled points; if all the points are labeled, we 
must totally discard the geometry info of the point clouds, for at that moment the label 
info is more reliable. As a result, the coefficient 4 has to be adjusted with the propor- 
tion of labeled points. 

Similarly with the point approximation, the total error defined for block approxi- 
mation is 


Err, = > (a- a,) erty, + a, ern; )= |Ys,B, (I, —A, II. + \(v = FAI (7) 


i=l 


optimal 
y" = FAA(M,+Aa")' (8) 
Here, M, =S,B,U, -A,)U, —4,)’ BUS) , K=kxn, A, =diag{al,,---,a,1,} is a sparse 


weight matrix. We take y° =——}’y, as the decision threshold for classification. 


igL 


4 Experiment and Discussion 


In this section, we investigate the performance of our proposed Local approximation 
(SLP)method for face representation and recognition. The system performance is 
compared with the Eigenface method Neighborhood preserving embedding (NPE), 
the Linear local tangent space alignment (LLTSA), and the Laplacianface method 
(LPP), three of the most popular linear methods in face recognition. We use the same 
graph structures in the Laplacianface and Local approximation (LP)method, which is 
built based on the label information. 

Three experiments were conducted to evaluate the performance of the SLP algo- 
rithms. The nearest neighbor algorithm was used to evaluate the recognition 
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technique. Each dataset was truncated into two subsets, one as a training set and the 
other as a learning set. The classification rates are means of all tests on each dataset. 
It should be noted that, since the focus in this paper is on feature representation, all 
of our experiments use a very simple classifier, i.e. nearest-neighbor classifier. In this 
experiment, the Duck dataset are randomly partitioned into two subsets, namely. 
Samples are shown in Fig. 1, which shows our experiment datasets. Duck dataset 
is extracted from COIL20. We use duck and face datasets and MNIST dataset for 
classification. 
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Fig. 3. DUCK recognition accuracy 
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Fig. 4. Facial recognition accuracy 
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Fig. 5. MNIST recognition accuracy 
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In the experiments, SLP (Semi-supervised local coordinate) is compared with other 
three inductive methods: LPP, NPE and LLTSA. Here, LPP is performed in semi- 
supervised manner. NPE and LLTSA are semi-supervised extended with prior infor- 
mation. The final results on all data sets are separately listed in figure 3, 4 and 5, and 
the best performance is shown in bold. The result in figure 3 shows that SLC outper- 
forms other methods on DUCK in all cases. figure 4 demonstrates the superiority of 
SLC and LPP on FACE. In figure 5, the classification accuracy reveals the benefits of 
both SLC and LLTSA. From these results, we can see that, SLP is a relatively robust 
classification algorithm, and achieves good performance on all the data sets. 


5 Conclusion 


An local approximation (SLC) algorithm is presented in this paper. It combines local 
approximation with the idea of preserving projections and weighted integration, and 
give a set of solutions to semi-supervised regression and manifold alignment for clas- 
sification. This algorithm presented greatly reduces the number of labeled data the 
classifier system needs in order to achieve satisfactory performance. Experiments on 
standard database show that our algorithm performs better than current classifier 
combination rules when considering both labeling cost and classification accuracy.. 
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Abstract. Conventional semi-supervised learning leverages the unlabeled data 
by intensive exploring the pairwise relation among the data points. However, it 
is well known that such relation cannot capture the complex interaction in many 
real-world applications. To address this problem, we proposed in this paper a 
new approach to effectively modeling the labeled and unlabeled data by local 
tangent space alignment, which is superior due to the properties of invariant to 
shift and scale. We apply the local tangent space alignment to semi-supervised 
learning tasks including semi-supervised classification. The experiments 
compared with state-of-the-art semi-supervised learning methods demonstrated 
the effectiveness of the proposed approach. 


Keywords: semi-supervised learning, manifold learning, local tangent space 
alignment. 


1 Introduction 


Recently, technology for dimension reduction has attracted much attention in pattern 
recognition, usually raw data taken with capturing devices are multidimensional and 
therefore are not very suitable for accurate classification. To obtain compact 
representations of raw data, some techniques about dimension reduction have come 
forth. From the geometrical point of view, dimension reduction can be considered as 
discovering a low-dimensional embedding of high-dimensional data assumed to lie on 
a manifold. The key of dimension reduction is to preserve the underlying local 
geometrical information of raw high-dimensional data while reducing insignificant 
dimensions. However, if the original data lie on a nonlinear manifold in nature, 
traditional dimension reduction methods such as Principal Component Analysis 
(PCA) will fail to well preserve its geometrical information in a low-dimensional 
space while unfolding the nonlinear manifold. That is, in the case of nonlinear 
manifolds, PCA often maps close points in the original space into distant points in the 
embedded space. In the recent years, a number of techniques have been proposed to 
perform nonlinear mappings, such as MDS [1], locally linear embedding (LLE) [2], 
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Laplacian Eigenmaps (LE) [3], manifold charting [4], Hessian-based locally linear 
embedding (HLLE) [3], Modified locally linear embedding (MLLE) [6~9], Laplacian 
Eigenmap (LE) [6] and ISOMAP [5]. One basic idea of local methods is to regard a 
small neighborhood of the manifold as a linear one and find a local geometry around 
each data point, and then use the collected local geometric information to nonlinearly 
map the manifold to a lower dimensional space. All of these are problematic in 
application in some way: firstly, multi-dimensional scaling and neural networks are 
hard to train and time-consuming. Mixtures of localized linear models require the user 
to set a number of parameters, which are highly specific to each data set and 
determine how well the model fits the data. Secondly, these nonlinear methods aim to 
preserve local structures in small neighborhoods and successfully derive the intrinsic 
features of nonlinear manifolds. Recently, Zhang and Zha [10] proposed a fine 
method: local tangent space alignment (LTSA). The research of nonlinear 
dimensionality reduction is based on that many high-dimensional data is confirmed by 
several hidden variables, such as the effect of face image sampling is determined by 
brightness, the distance between person and camera, head pose, facial muscles and so 
on. From the perspective of cognitive psychology, Psychologists think that cognitive 
processes are based on topological continuity. Thus nonlinear method is feasible. This 
paper introduces the principle of locally linear embedding, applies it to face 
recognition, and analyses the recognition rate. Comprehensive comparisons and 
extensive experiments show that the approach achieves much higher recognition rates 
than a few competing methods. 

Face recognition and fingerprint identification technology have been applied widely. 
Because of different light and expression, every person has many different face 
images. However, gaining effective and reasonable low dimension face image is much 
more difficult from high dimension face by keeping whole face information. It is a 
imperative solved problem. Consequently, potential framework of high dimensional 
data should be discovered through studying the low dimensional character embedding 
the high dimensional space, then we can recognize face efficient. 

The rest of the paper is organized as follows. We outline the basic steps of LTSA 
and illustrate its failure modes using two examples in Section 2. The SLTSA will be 
proposed in Section 3. We give an analysis and experiments of SLTSA and others on 
classification in Section 4. Finally, the conclusions are given in Section 5. 


2 Local Tangent Space Alignment 


We first outline the basic steps of LTSA. The basic idea of LTSA is to construct local 
linear approximations of the manifold in the form of a collection of overlapping 
approximate tangent spaces at each sample point, and then align those tangent spaces 
to obtain a global parametrization of the manifold. Details and derivation of the 
algorithm can be found in [10]. Given a data set X; = [x,4--5%)] with x,<¢ R” , sampled 


Semi-supervised Local Tangent Space Alignment 613 


(possibly with noise) from a d-dimensional manifold (d << m), x, = f(¢,)+é,, where 


f :Qc R4 > R”, Q is an open connected subset, and ¢, represents noise. LTSA 
assumes that d is known and proceeds in the following steps. 


(1) Local neighborhood construction. For each x, , i=1,...,N , determine a set 
X,;= Lx, ,...,%;, | Of its neighbors (k nearest neighbors, for example). 


(2) Local linear fitting. Compute the optimal rank-d approximation to the centered 


: = —- lr ‘ : : 
matrix (X, —x,e"), where x, = qui , and e is a k-dimensional row vector of all 
= J 


1’s. By the SVD of X,—xe", we can obtain the orthonormal basis Q, for the d- 
dimensional tangent space of the manifold at x,, and the orthogonal projection of each 


i 


x,, in its neighborhood to the computed tangent space 6,” = Q/ (x, —%). 


i 


(3) Local coordinates alignment. Align the N local projection 0, = [2”....,4| ; 


= 1..4N , to obtain the global coordinates Denote 1,,...,7, , and Tale kG: 


which consists of a subset of the columns of T with the index set {i,,...,i,} determined 


by the neighbors of each x,. Let E, =T,-c,e’ —L,O, be the local reconstruction error 


Eb 2. 


‘ 1 1 ‘ 
matrix, where eae and L, = TI ~— ee" )@; =T,0; , where ©; is the Moor- 


Penrose generalized inverse of and e is a vector of all ones. Then the alignment of 
LTSA is achieved by minimizing the following global reconstruction error: 


E(T) = rel = Diminfy ~cet -1,,| =|rsw||. (1) 
Where S =[S,,...,Sy] and W =diag(W,,...,Wy), with 
1 Li +—Q+ 
W, =U-see \I-@f0*) (2) 


To uniquely determine T , we will impose the constraints TT’ = /, , it turns out that 


the vector e of all ones is an eigenvector of B corresponding to a zero eigenvalue, 
B=SWW'S" (3) 


Therefore, the optimal T is given by the d eigenvectors of the matrix B , 
corresponding to the 2nd to d+1st smallest eigenvalues of B 

From the basic steps of LTSA, one can see that recovering the real local tangent 
space is the key issue that relates to whether LTSA can discover the true manifold 
structure faithfully. In the presence of noise, however, the recovered tangent space by 
standard SVD technique will deviate from the real one due to its sensitivity to noise, 
then it will further influence embedding result of LTSA. 
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When computing local alignment error, LTSA gives each point the same weight in 
equation (1). Obviously, in noise case, we should make distinction between clean 
points and noise points, and this can be implemented by specifying different weights 
to them. In addition, there is no need to minimize the local align error sum of all 
neighborhoods in equation (3). On the one hand, in no noise case localtangent space 
coordinates derived from each neighborhood can character local geometric well, they 
are heavily redundant. On the other hand, in noise case, forcedly aligning the local 
coordinates of neighborhoods that are not well approximated will result in fatal error 
in the final embedding. Therefore, one should discard the neighborhoods whose local 
coordinates can’t character the local geometric well due to dominant effect of noise, 
and select neighborhoods that are approximated well by the local tangent space 
coordinates, then minimize the alignment error sum of these selected neighborhoods 
in equation (3). 


3 Semi Supervised Local Tangent Space Alignment 


LTSA offers us a method to determine the approximation matrix W, for the block 
mode. Here @, is the mapping of Xz, in the local tangent space, 6° is the Moor- 


Penrose generalized inverse of @,, and W, = 6"@, acts like a correlation matrix of the 
points around x, . 


Learning with the label value can be regarded[8] as the problem of approximating a 
multivariate function from labeled data points. The function can be real valued as in 
regression or binary valued as in classification. Learning with the label value can also 
be regarded as a special case of dimension reduction that maps all the data points into 


the label value space. The label error of y,is defined as err, = s\| y,-f, * where gs, 1S 


1 + ie L 
O -- i¢L 
of indices of labeled points, and F =[f,,---,f,]is the given label value. The loss 


the flag to identify the labeled points satisfying s, -| , Lis the collection 
function Err, (Y) defined on weighted combination of point approximation error and 
label error and its optimal solution Y* are shown in (3). 


Err, = 3 ( a,) err, + a, ern, )= |v, (1, = All. + Iv - F)Al. (4) 


i=l 


Then 


y’ = FAA"(M, +Aa’)' (5) 
where M, =B,(1,-A)(1,-A) B,’ , a, = La-a°)+a")s, is the weight coefficient at 
n 


y,, Lis the number of labeled points, a° is the minimal weight coefficient set by user, 
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and A=diag(a,) . Here, the setting of the weight coefficient a, is based on the 


following two assumptions that: if the proportion of the labeled points decreases, we 
have to reduce our dependence on the knowledge only retained from the labeled 
points; if all the points are labeled, we must totally discard the geometric knowledge 
of the point clouds, for the label information is more reliable and the geometric 
knowledge is completely useless at that moment. 

Similarly, the total error defined for block approximation and its optimal solution 
are 


Err, = Xl —a,) err, +4, ern; )= |vs,B, (1, —A, I. +|\(v - F)Al (6) 
Optimal 
y" = FAA"(m,+Aa’)' (7) 
Here, 
M, =S,B, (Ig — A, (ly —A,) B'S," (8) 
K =kxn, A, =diag{a,I,,---,a,1,}is a sparse weight matrix. We take y° = — Pa! as 
—l ier 


the decision threshold for classification. 


4 Experiment 


Two experiments were conducted to evaluate the performance of the SLTSA 
algorithms. The nearest neighbor algorithm was used to evaluate the recognition 
technique. Each dataset was truncated into two subsets, one as a training set and the 
other as a learning set. The classification rates are means of all tests on each dataset. It 
should be noted that, since the focus in this paper is on feature representation, all of 
our experiments use a very simple classifier, i.e. nearest-neighbor classifier. 

In this experiment, the ORL data sets are randomly partitioned into two subsets, 
namely, 200 training images and 200 test images, with no overlap. Samples are shown 
in Fig. 1. 


Fig. 1. Some original ORL faces 
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Fig. 2. Facial recognition accuracy 


In this experiment, the training sets are randomly selected from the training set 
with 1000 images of each digit, while the test sets are all images of digits in the 
testing set. 


Fig. 3. Digit samples in MNIST data set. 


The handwriting digits vary based on the habits of each person. The digits have 
been size-normalized and centered in 28X28 pixels gray-scale images. In this 
experiment each algorithm is performed on the data set ten times. The mean 
accuracies are shown in Table 1. Here we also choose K-NN as the classifier, and 
K=15. The feature space is set to 
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Fig. 4. MNIST recognition accuracy 


However, our method still has some shortcomings. The method just gives an idea 
for semi-supervised manifold learning. First it introduces an additional parameter; 
second it ignores the statistical feature, which has been used in Charting a Manifold. 
From the theoretical analysis we can obviously find the effective parameters to noisy 
learning; thus the next research will focus on using probability to reduce the noise 
affection according to the effect of these parameters. Nowadays Jenkins, et al 
proposed an average method which can greatly reduce the incidence of face 
recognition errors. This method stimulate us to research a new manifold learning 
method to find texture manifold and structure manifold, which can increase the 
robustness applied in pattern recognition of manifold learning methods against noise 
disturbance. 


5 Conclusion 


In this paper, we proposed a novel semi-supervised method based on Local Tangent 
Space alignment. Different from some manifold learning algorithms that preserve the 
global or local metric knowledge of the high-dimensional data, local approximation is 
invariant to shift and scale, also, It utilizes the unlabeled data by intensive exploring 
the pairwise relation among the data points, and offers us a useful approach to model 
the labeled and unlabeled data in semi-supervised learning tasks. There are some 
problems to be solved for future work: 1.Choice of coefficients: find new 
approximation coefficients to precisely describe the relationship between point 
clouds; 2.find new way of computing inter-data to better maintain the global or local 
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property of high-dimensional data; 3.Nonlinear function: most of functions in this 
paper are linear, that are represented by matrix. We can replace them by nonlinear 
function to achieve better performance. We could perfect the algorithm and find better 
method to improve the efficiency in the future work. 
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Abstract. In the paper, different types of energy generated by the underwater 
plasma discharging are analyzed and calculated. Together with data of the dis- 
charge voltage, discharge current, shock wave signal and the bubble wave, the 
method for calculating the resistance of the channel is deduced, based on the 
conversation of energy. With the method, a time-varying resistance model R(t) 
about the discharge channel is built up (Eq11) and some simulations were done. 
Moreover, the discharge circuit loop equation is deduced based on the proposed 
time-varying resistance model. By using the finite difference method, the dis- 
charge voltage and current are computed (Eq14-15). The simulation results are in 
a good agreement with the experimental result (Fig.4). The result in this paper 
provides guidance for the research of the discharging channel characteristics, the 
design of its circuit, the controlling of the energy conversion and the enhance- 
ment of the efficiency of the electricity to sound. 


Keywords: underwater plasma acoustic source, discharge channel and circuit 
characteristics, simulation and experiment, time-varying resistance model. 


1 Introduction 


The impulse sound wave generated by the underwater plasma acoustic source, in virtue 
of underwater high-voltage discharge[1], has many advantages, such as high instan- 
taneous emitting power, narrow wave width, wide frequency bandwidth, fast reaction 
and easy to focus or control [2]. Since these characteristics, this technology has been 
used widely in industry, medical treatment and so on, like lithotrities[3], removing the 
dirty in pipeline[4], food sanitation and sewage treatment[5], etc. In present years, this 
kind of source also have been applied in ocean geology reconnoitering[7], underwater 
targets detection[8], wideband acoustic interference[9] and sound power weapon[10]. 
Although the acoustic source based on the underwater plasma discharge has the said 
advantages and wide application, the transient and randomicity of the discharge process 
make the physical phenomenon very complex and the mechanism of electric-sound and 
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the characteristics of the discharge channel not explicit enough. Therefore, exact cal- 
culating the resistance of the discharge channel will be helpful to the reaches for the 
mastery of the discharge circuit features. It also can give a guidance for enhancing the 
discharge efficiency and decreasing the energy dissipate in the discharge channel[1 1]. 

In this paper, a model about the resistance of the channel was built based on the 
plasma channel energy balance equation. With the help of the model and the finite 
difference method, the discharge current and discharge voltage during the underwater 
high-voltage pulse discharge were simulated. Simulation results and experimental 
results were in good agreement. 


2 Conversation of Energy Equation 


If water is stressed with a high voltage pulse having a rise time of tens of nanoseconds, 
stream develop and multi conductive channel forms between the electrodes. The in- 
tense Joule heating of the plasma in the channel results in the temperature in the 
channel up to 10*K[12]-[15]. At the same time, the shock wave is generated together 
with radiation. Then bubble will be formed between the electrodes and emits bubble 
wave because of its fast expanding and collapsing. When plasma discharging under- 
water, many types of energy are produced concluding[16]: the internal energy inside 
the channel, the radiation energy, the shockwave and the babble energy. The energy 
assumed by the light and electromagnetic radiation can be ignored. Thus, the rela- 
tionship among the different energy is 


Wais =W,, +Wq + W, +W, (1) 


Where W,, is the internal energy inside the channel, W,,, the radiation energy, W, the 


shock wave energy, W, the bubble wave energy, W,,;, the energy transfused into the 
channel and can be expressed by Eq2 


Walt) =f POR(at Q) 


Here R(t) is the resistance of the channel, i(t) is the discharge current and ¢ is the 
duration of the channel. Associating with the Eq! and Eq?, it can be seen that, while the 
different energy at the right hand of the Eq! and the discharge current are known, the 
resistance of the channel can be got. 


3 Analyze and Computation of the Energy in Discharge Channel 


3.1 The Internal Energy 


The temperatures in the plasma channel between charging period of the two electrodes 
can reach in excess of 10000 K. At this temperature, water vapor no longer behaves as 
an ideal gas with constant specific heat; which rather dissociates into hydrogen and 
oxygen. Ionization of hydrogen and oxygen can begin at temperatures of this order as 
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well. For easy calculation, assuming the gas, including water molecules and different 
particle generated by the dissociation and ionization of the water molecules, are at a 
balance state[17]. Based on Landau-Lifshitz law, the particle or ion inside the charging 
channel can react in the following 5 forms (Eq3). 


@,:2H,0 © 2H, +0, [+4.96eV] 


a, :H, 2H [+ 4.48ev] 
a;:0, 20 [+5.1leV] (3) 
@,:HOH* +e [+13.6eV 


a,:0 00% +e [+13.6eV] 

The datum inside the bracket is the minimum energy for the corresponding reaction. @, 
is the reacting rate of the corresponding reaction, with 0 < @, <1, where 0 depicts non 
reaction, | depicts reaction finish. Therefore the energy added inside the channel is 


W, =D, (4) 

Where E, is the energy added during each reaction in Eq.3, can be calculated by Eq5. 
1 

E,=>N Ac, kT + €;) (5) 


Here, 1S f/<8,15i7<5, ee is the freedom of the molecule or the atom, k is 


Boltzmann constant, € 


»; 18 the ground state energy of the each atom, T is the tem- 


perature in the channel, N,, is the number of different particle. If the initial number of 
the water molecule is N, after dissociation and ionization, N gun iN .Where 1 ‘ is the 


function of @, which is gotten from Equation (3). 


3.2. The Radiation Energy 


Thermal radiation, which is strongly dependent on the temperature of the system, can 
be an important contribution to the energy partition of the underwater spark. In a naive 
model of the gas globe as a perfect blackbody radiator, thermal radiation represents a 
power loss proportional to the fourth power of the temperature as described by the 
Stefan—Boltzmann law, 


W,,, =4aR°S-t (6) 


Where, R is the radius of the discharge channel. S is the electromagnetic flux normal to 
the surface of the channel. The flux at the surface of the discharge channel is the inte- 
gration over photon frequencies 


S= [sav (7) 


S\, is the flux at special photon frequency V. 
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3.3. The Shock Wave Energy and Bubble Wave Energy 


When underwater plasma discharging, both shock wave and bubble wave are gener- 
ated. At time domain, the shock wave generated while the plasma channel formed is 
before the bubble. Although the shock wave and bubble wave are two kinds of energy, 
they can be calculated with the same formula[18] 


An - r aT 
W,,= [' P(yar (8) 
> pC 0 
Where, 7 is the distance between the sensor and the source, p is the density of the 


water, C is the sound speed, 1, is the signal’s width, P(t) is the signal. 


4 The Model and Computation of the Resistance in the Discharging 
Channel 


4.1 Simulation of the Resistance in the Discharging Channel 


With the experimental data of the shock wave, bubble wave and the discharge current, 
the model is simulated. The parameters used in the simulation are as follows: 
temperature of the discharge channel is 30000K; the temperature inside the bubble 
is 4000K. @, in equation (3) are @, =0.9, @, =0.3, @3 =0.3, A, =0.01, a; =0.01 


respectively. 
For easy simulation, the derivative of the Eq2 about t is 
“(\2 dW i; 
i(t}’ R(t) = —_—* (9) 
(x)= 
Where 
dW 
Was 4 (y, +w,, +W,+W,) (10) 
dt dt 


Combining Eq3~10, the resistance of the discharge channel is deduced and the curves 
are drawn in Fig.l. From which, it can be seen that, before the formation of the dis- 
charge channel, the two electrodes can be considered as open circuit. Because of the 
discharge, the resistance of the channel changes from a few Ohms at the beginning of 
the discharge to only a few milliohms at the metaphase. At the end of the discharge, the 
resistance will increase. 

In Fig.2, resistances inside the discharging channel are the result from reference, 
curve 1, [19], curve 2 [20]. Curve | was gotten with method of calculation of the 
discharge channel conductance. Curve 2 was a experimental result. Comparing Figures 
1 and 2, a good agreement between the results of the references with that of the simu- 
lation of this paper will be found. 
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Fig. 1. Resistance waveform during the discharge 
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Fig. 2. Resistance curve in the discharge channel (Ref. 19-20) 


4.2 Time-Varying Model of the Discharge Channel 


In fact, the resistance during the discharging can be from kQ, at the begin of dis- 


charging, to mQ, at the end of the discharging. The equivalent resistance R(t) can be 
denoted with the exponential function 


2 
tta 


ees *] +R, aap) 


Where Ro is the equivalent resistance of the electrodes after the channel formation. For 
the case of tap water (resistance rate3.42 Qm), the distance and the top radius between 


the electrodes 2mm and 0.5mm respectively, Ro-7.5 kQ A, a, b can be got from the 
Fig.2 and the experiments. 


5 The Simulation and Experiments of the Discharge Circuit 


5.1 The Model of the Discharge Circuit and Its Simulation 


The equivalent circuit diagram of the discharge circuit is shown in fig.3, where, K is the 
switch and L is the inherent inductance. 
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= R(t) 


cana 


Fig. 3. Equivalent circuit diagram of the discharge circuit 


Eq.12 is the loop equation of the discharge circuit. 


pio, R(il)+ + fildar=0 42) 
The derivative of Eq12 about t is 
d’i(t) di(t) [ dR(t) 1 ) 
L + Rit + + t)=0 13 
eg a ol = 


The Eq13 is solved with the finite difference method and the discharge current is ex- 
pressed in Eq1/4: 

At? 

i(n) 2L+2R(n)At— ArR(n+1)-——| 


ea) L+ R(n)At ~ L+R(njat 


The initial conditions of the Eq14 are i(0) = 0 and i(1) = 10 pA. The discharge voltage is 
u(n) = i(n)- R(n) (15) 


Based on Eq14, Eq15 with some certain initial conditions (according to the setup and 
the discharge), the normalized discharge current and voltage waveforms are shown in 
Fig.4. The reason of the oscillatory of the discharge current and voltage waveforms is: 
after the plasma channel formed, the equivalent resistance of the electrodes is at the 
level of mQ, at this condition, the type of the discharge will convert to underdamped 
oscillations decay mode, just like the Fig.4. 


5.2 Comparison between the Experiment and Simulation 


Experiment are done inside a 20x7x8m water pool with noise elimination. The 
charging capacity is 100uF. The charging voltage is 6kv.The charging electrodes is a 
coaxial cable, which in the depth of 2m. The shock wave and the bubble wave are 
collected with PCB pressure sensor, resolution 6.95MPa/V. The discharging current is 
measured with a Rogowski coil with 50kA/V; high-voltage detector is TEK6105A. 

In order to test the validity of the method to calculate the discharge voltage and 
current which is based on the exponential model of the resistance, some simulation 
have been done at the same condition with the experiment. Both simulation result and 
experimental result are plotted in Fig.5. From which we can find a good agreement 
between the results. 
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(b) Discharge voltage 


Fig. 4. Comparison of the simulation and the experiment results 


6 Conclusion 


(1) The main energy generated during the underwater discharge are systematically 
analyzed and computed. Based on the measurement of the discharge current, the 
resistance of the discharge channel has been calculated due to the conservation of 
energy. From the comparing the simulation results to other references, a good 
agreement are found and the validity of the method is confirmed. 

(2) The models of time-varying resistance and the discharge circuit are built up and 
some simulations have been done. The simulation results have a good agreement 
with the discharge voltage and current measured. 

(3) A model for describing the discharging circuit is set up based on the proposed 
time-varying channel resistance. The results simulated with the model and the fi- 
nite differential method are in good agreement with that measured. 

(4) The simulation and experiment indicate that the main electric energy is converted 
to the internal energy when discharging, which heats the water and make the dis- 
sociation and ionization happen. Only 4%-8% energy converts to sound wave. In 
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order to enhance this efficiency, maybe some salt can be put into the water or a fine 
wire is set between the electrodes, which makes the formation of the discharge 
channel become easily. 
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Abstract. The article illustrated that there are four factors work on the core 
competence of industrial clusters. Then the writer designed a model of evaluation 
index system, and used fuzzy synthetic evaluation to analyze the core compe- 
tence of industrial clusters quantitatively and to identify the problems. So we can 
put forward corresponding countermeasures and measures to enable the rapid 
development of industrial clusters, at the same time promoting the regional 
economy and the country's macroeconomic development. 


Keywords: industrial clusters, core competence, fuzzy synthetic evaluation. 


1 Introduction 


Industrial clusters are playing a more and more significant role in regional economic 
development, both academic and government departments pay high attention to the 
phenomenon of industrial clusters. Many countries and regions have developed in- 
dustrial clusters research programs to identify and implement cluster development 
strategies. Through the interactive cooperation and exchange, enterprises of cluster can 
bring economies of scope and economies of scale into play, and can also have a strong 
spillover effect, so as to promote economic development of a particular region and the 
whole nation. Industrial clusters have become a worldwide economic phenomenon. 
Many scholars and experts carried out researches and put forward many theories. Such 
as the external economic theory of Marshall (1920) [1] and Krugman(1991) [2];the 
Regional aggregation economic theory of Weber (1909), Hoover (1948) and Barton; 
the regional production complexes and growth pole theory of Korosovski and Perroux; 
the diamond model /new competition economic theory of Porter (1998); regional 
economic dynamics/new industrial space theory of Scott, Storper [3], Harrison and 
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Waler. And the transaction cost theory of Williamson and the economic network of 
Harrison (Harrison) in recent years. From the 1990s, Chinese scholars have joined 
the research ranks, and advanced the theory of innovation spaces (Wang Jici, 1994), the 
new industrial district scale structure, contacts, embeddedness (Li Xiaojian, 1997), the 
regional economy, the informal system (Qiu Baoxin, 1999), business networks and 
entrepreneurial networks (Guangdong scholars), social network (Taiwan scholars) and 
innovation networks (Gai Wenqi, 2002). These ideas and theories explained and 
demonstrated the law of the existence, developing and changing of industrial clusters 
from different levels and different points of view. However, how did the core compe- 
tence of industrial clusters form? How to evaluate the core competence of the industrial 
cluster qualitatively and quantitatively? The series issues of how to enhance the core 
competence of industrial clusters of China still need further study and explanation. 


2 The Core Competence of Industrial Clusters 


C. K. PrahaladandG. Hamel 1990) published an article “The Core Competence 
of the Corporation” in the Harvard Business Review, raised the original notion of the 
core competence of the corporation for the first time. They considered that core com- 
petence is “the accumulation of knowledge of the organization, particularly the 
knowledge of how to coordinate different production skills and how to combine a 
variety of skills organically"! In their opinion, due to the difference of the core com- 
petence, the corporations in the same industry have different performances. The core 
competence is the key dominance for the corporation to survive and develop in the 
fierce market. 

As one of the important backbones to promote regional and national economies, 
industrial clusters should also strengthen its core competence continually in the back- 
ground of production and competition globalization. The core competence of industrial 
clusters formed in long-term development of the clusters, so that clusters maintain 
long-term stable competitive advantage, and obtain the most basic competence for 
long-term sustainable development. This competence builds on the foundation of the 
clusters’ core resources, embodied by the comprehensive competitiveness beyond of 
other competitors in production, culture, innovation, marketing, and so on. 

The core competence of industrial clusters should satisfy the following three as- 
pects: Firstly, embeddedness, the clusters’ core competence is accumulated by unique 
methods during the practice in the long term, which rooted in the corporate group and 
which is difficult to mimic or create for the outside competitors; secondly, scalability, 
the core competence of industrial clusters is not limited to a particular enterprise or 
product, but content in a variety of business areas within the cluster; thirdly, periodical, 
the clusters’ core competence is a performance that the cluster gains a dominant posi- 
tion in the market competition in a certain period. 

So, while many capabilities have contributed to the generation of competitive ad- 
vantage, but only the core competence can create a sustainable dynamic competitive 
advantage. Industrial clusters have low-cost flexible manufacturing system, excellent 
competitiveness of collaborative culture, dynamic mechanism of continuous innova- 
tion and marketing competitiveness. 
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3 The Elements and Evaluation Index System of Industrial 
Clusters’ Core Competence 


3.1 Low-Cost Flexible Manufacturing Competitiveness 


In product manufacturing, because the industrial enterprises within the cluster are 
geographically close to each other, so that they can find a variety of upstream and 
downstream industry resources in the region nearby, and can form vertical integration 
of corporate chain within the industrial cluster rapidly through the professional division 
of labor and collaboration. Saving transaction costs, reducing the search range and 
information search cost and shortening the matching path, the close geographical lo- 
cation and frequent interaction help enterprises to rapidly launch new products and 
quickly capture the market and form industrial scale in a short time. Concerning the 
competition of modern enterprises, the advantages of time and speed becomes more 
and more obviously. Enterprises and supporting cooperative ones that can produce new 
products to meet market demand fast, often can take high monopoly profits because of 
its first-mover advantage in competition. 

The author use following index to evaluate low-cost flexible manufacturing com- 
petitiveness: the rate of adoption of advanced equipment; average labor productivity; 
average capital input-output ratio; human Resources Recruitment average annual cost; 
ability to adapt to market changes. 


3.2 The Competitiveness of Collaborative Culture 


Collaborative culture is the deep-level factor during the formation of the clusters’ core 
competence, which is difficult for competitors to imitate. 

On the one hand, The formation of cluster’s collaborative culture, stems from the 
basis of cluster formation——“trust and cooperation”.[6] The long-term cooperative 
relationships between enterprises within the cluster do not need to maintain completely 
by the contract, but carry out through commitment and trust. Established in the cor- 
porate culture, the divisions of labor and cooperation relations are very stable and able 
to generate synergies, which cannot be obtained by the individual enterprise outside the 
cluster. So the enterprises within the cluster have a unique competitive advantage when 
faced with external competitors. 

On the other hand, the formation of cluster’s collaborative culture stems from the 
cluster talents mechanism. Under the pressure of competition, cluster enterprises attach 
importance to human resources development, especially the high-tech human resources 
development, and the mechanism to attract and stimulate talent, and form their own 
unique products. Thus, the products produced by the cluster corporations are of the same 
type, but vary in quality, style or otherwise. The clusters convert the competition among 
clusters’ enterprises to a synergistic competition by achieving product differentiation. 

These indicators are used to evaluate the competitiveness of the cluster collaborative 
culture: degree of division of labor; the average compliance rate of cluster enterprises; 
cluster cohesion; communication degree; insight and innovation in entrepreneurs. 


630 Y. Liu and Z. Hu 


3.3 Continuous Innovation Competitiveness 


The enterprises, universities, research institutions within the cluster locate closing to 
each other, which is conducive to the rapid diffusion and sharing of technology. We 
know the most basic model of technology diffusion is neighbor-proliferation: from the 
center to the edges, decaying with the distance. And the geographic proximity of the 
cluster enterprises is particularly conducive to enterprise employees carry out a range 
of formal and informal exchanges and promote the dissemination of tacit knowledge, 
and access to knowledge spillovers[4]. Geographical concentration will also help sti- 
mulating the companies to win the race peer effects. Within the regional cluster, 
comparisons between enterprises continue to promote the constant improvement of 
business management and accelerate technological innovation. The presence of the 
high intensity and constant comparison, coupled with easy access to innovation re- 
sources and the pursuit of the leading aspire of value, making the innovation conti- 
nually producing in cluster region and quickly imitated by other companies within the 
cluster, the value of innovation and rapid implementation and testing, thus the innova- 
tion cycles continue to be shortened and the innovation cycle is accelerating. 

These part indicators list as follows: the rate of adoption of advanced technology; the 
ratio of new products; R&D investment rate; Rate of technology developers; R&D 
input-output ratio. 


3.4 Marketing Competitiveness 


Marketing competitiveness of industrial clusters mainly assumes as a region brand and 
marketing innovation system [7]. On the one hand, it is easy to establish for the in- 
dustrial cluster to built a region brand. Compared to a single corporate brand, region 
brand (such as West Lake Longjing tea, Italian fashion) is a more visual image, has a 
broader, continuing brand effect. The more product brand name within the cluster, the 
bigger cluster force is, and the more likely to attract a variety of resources. It is con- 
ducive to the cluster companies gain market share, and create a competitive cluster. On 
the other hand, it is easy for the cluster enterprises to establish marketing innovation 
system. Cluster enterprises can establish a marketing innovation system, which built 
market development as the core and customer’s demand as the center. In this system, 
the marketing innovation training is strengthened, the quality of personnel is improved, 
and the marketing innovation is promoted, thereby strengthening the cluster market 
competitiveness. 


4 Fuzzy Comprehensive Evaluation Model of Industrial Clusters’ 
Core Competence 


Many indicators of industrial clusters’ core competence cannot be quantified with the 
specific value, can only be qualitative assessment of its extent, so the author used 
two-lay fuzzy comprehensive evaluation method to evaluate [8] [9]. Based on fuzzy 
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comprehensive evaluation and the core competence of the cluster evaluation index 
system, the author established the following evaluation model. 


4.1 Single Factor Evaluation 


4.1.1 Establish Factor Set 
We call factors that influence the core competence of industrial clusters as a factor set. 
It is acommon collection, use U to stand for: 


US gd SU 4 a) 
Of which: 


U, Indicate that the industry cluster competitiveness of low-cost flexible 
manufacturing 

U,, Indicate that the industry cluster competitiveness of the cultural synergy 

U, Indicate that the industry cluster competitiveness of continuous innovation 

U,, Said the marketing industry cluster competitiveness 


Considering U,, U,, U,, U4 , each have 5 influence factors, so 
U,= tA § Ais Ags Aygo Ags) i 1,2,3,4) 


4.1.2 Establish Weight Set 

In general, the importance of each factor is different. Therefore, to reflect the degree of 
the importance of each factor, each factor should be assigned a certain weight, and U 
establish a corresponding set of weights A: 


A={a, ,47,43, Weoley ,a,} 
And meet >? a; =1 
i=l 


Suppose we use the AHP method or random survey by the weight of Uis: A, G= 
1,2,3,4). 


Aji = (4j1,+4)2>4;3,4i4} 


4.1.3, Establishment Evaluation Set 
This evaluation set to evaluate a variety of evaluation objects may make the composi- 


tion of the total set of evaluation results. With V stand for: 


The core competence of industrial clusters evaluation, the evaluation set can be set: 


V= {V¥1..¥2,V3,V4.V5} 


Which Vv, = very strong; V, = strong; V,= moderate; V,= weak; V; = very weak. 
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Obtained by methods such as Delphi, the evaluation results matrix of U, is C a 


Ci Ci C3 Cig CIS 
C21 C22 C23 CM C25 
Cj =| C31 C32 C33. C34_—C35 
C41 C42 C43 C44 C45 
C51 C52 C53. «C54 C55 


i 1,2,3,4,53 1,2, 3,4, 5) 


j=l 
Construction U, single factor comparison matrix R, 


Al oa: AR. Md OS 
Mm 122 3 «na t5 
Ri=/, 2 30 a 35 
Tay Tan T4344 M45 
i ee 7 a 


Similarly, comparison matrix can be constructed R, : R, ,R as 


Order U , evaluation vector B, there are: 
B, = AjOR; = (bj, ,b)2.5;3,0;4, 5:5} 


Therefore, it results: 


B, = A,oB, 

By = A, 0B, 
B3 = A30B, 
By = A,oBy, 


4.2 Two-Lay Judges 


For the first-lay index set U ={U,,U,,U,,U,} , the weight assigned 


toA, = {ai ,@;554;3,4;,} , Make two-lay evaluation matrix as: 
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For the two-lay evaluation, have B=AoR= {b, 5 b, ‘ b, - b, ‘ b; } 


According to the principle of maximum membership, if b; the most, it shows that 
low-cost flexible manufacturing competitiveness, cluster collaborative cultural com- 
petition, continuous innovation, marketing competitiveness are strong, the core com- 
petitiveness is also strong; if bs; is the maximum, then its core competence is weak. 


5 Conclusion 


The strength of industrial clusters’ core competence is important to industrial clusters 
development in the long-term, and also important to China's entire economic devel- 
opment process. According to the evaluation of the cluster's core competence, imple- 
menting targeted strategies and taking corresponding measures to promote industrial 
clusters developing rapidly and healthily has been a priority. Using the fuzzy com- 
prehensive evaluation method for evaluating indicator system model can comprehen- 
sively evaluate the cluster's core competence and help to identify the problems. We can 
put forward corresponding countermeasures and measures to enable the rapid devel- 
opment of industrial clusters, at the same time promoting the regional economy and the 
country's macroeconomic development. 
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Abstract. In recent years, the wireless communication networks shows a high- 
speed, broadband and general development trend, including the rapid develop- 
ment of wireless technology represented by WLAN, WiFi, WiMAX, WSN, 
Mesh, 3G and B3G. As 3G is a present heat technology, the research on how to 
achieve the integration of 3G networks with WLAN and WiMAX networks is 
full of practical value. This paper just discusses and researches about the issue, 
and proposes appropriate solutions. 


Keywords: heterogeneous networks, Integration, WCDMA ,WLAN. 


1 Introduction 


Recent years, WLAN, Adhoc networks, WSN, WiFi, WiMAX and many types of 
wireless networks technology were generally researched, simultaneously, the mobile 
communication technology has experienced rapid development from 1G (AMPS, 
TACS) to 2G (GSM, CDMAOne) until the present heat 3G (WCDMA, CDMA2000, 
TDSCDMA) just in a short decade. So far, the global wireless communication system 
turns out the trend of broadband mobile and mobile broadband. Under the present hete- 
rogeneous networks environment, there are types of wireless technology exists and 
which are relatively independent, lacking of an effective coordinate system. This may 
cause problems of system disturbance, frequency resource scarcity, networks seamless 
handoff and etc unsolved. So it is an urgent issue how to realize the effective integra- 
tion of these heterogeneous networks, especially the 3G networks with present wireless 
technology [1]. 


2 The Integration of WLAN and 3G 


WLAN is the wireless LAN standards made by IEEE. At present, serial standards 
802.11 is widely used, frequency in the 2.4 GHz range, which has an advantage of 
cheap price, flexible networks, high-speed wireless data access supporting, unrestricted 
band and etc. The expanding of WLAN from cable LAN into wireless world, is aiming 
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at supplying data communication under the wireless environment, but not voice capaci- 
ty. The 3G networks are built on cellular infrastructure, which is most suitable in sup- 
porting data service under mobile environment. Cellular infrastructure backs up signal 
handoff between different cellular networks, thereby provide customers with mobility 
covering the whole networks. 

3G whose logos is to supply the public with carrier-class businesses, set up it’s aim 
of at providing multimedia business at beginning, which contains voice and data, glob- 
al roaming and so on. So the user can enjoy nice networks safety as well as different 
QoS. As know, WLAN has obvious advantages in indoor wireless data business, but 
it’s weaknesses of low coverage, bad mobility and nonsupport of voice are also promi- 
nent; otherwise, 3G can support wireless voice and data business, has a good coverage 
and mobility, but low bandwidth is its big weakness. If the 3G operator combines with 
WLAN supplier, integrates 3G networks with WLAN networks, then the wireless users 
can enjoy high-speed data transmission as well as seamless roaming between different 
networks without any restriction [2, 3]. 


Table 1. Contrast parameters table of WLAN with 3G technology 


Parameter WLAN 3G 
Working 2.4GHZ 3.11GHZ 
frequency working frequency permissive 
frequency 
Broadband 11-108Mbps 2Mbps 
Coverage 50-150m global roaming 
Service high-speed voice and data 
orientation data integration 
Equipment Data/PC center telecom 
operations center 
Main FH and DSSS CDMA 
technology 


As in table | has listed the contrast parameters of WLAN and 3G technology, and 
the following is to show you two modes for integration. 


2.1 The Tightly-Coupled Integration Mode of 3G with WLAN 


As showed in figure 1, the WLAN networks apply the same way with 3G base stations 
to connect with 3G core networks, which takes full advantages of mobility, safety and 
quality service existed in 3G core networks. But to connect with 3G core networks 
directly, the present WLAN production of tightly-coupled integration need to be remo- 
dified and remade, and the equipments support UMTS and CDMA are also different. 
In this mode, WLAN brings 3G main transmission networks business traffic and sig- 
naling, connects to core networks’ AAA server through AC, and be authenticated by 
HLR (Home Location Register). And the data channels of WLAN directly connected 
with PDN through AC. The service charge information of using WLAN are gathered 
by AC, and reported to 3GPP charge system through AAA server. So, making a little 
modification to the standards and architecture of 3GPP networks, can realize uniform 
authentication and charging. In this mode, apply mobile IP technology can also support 
roaming operations between two different systems. 
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Fig. 1. The tightly-coupled integration architecture of 3G with WLAN 


2.2 The Loosely-Coupled Integration Mode of 3G with WLAN 


As showed in figure 2, WLAN connects with mobile networks as a supplementary 
form, only shares the usage of AAA with 3G, so avoid the hybrid of data flows caused 
by two different access technology at the 3G core networks node. This combination 
guarantees the totally separate of two wireless networks, make them complete inde- 
pendence. Sharing AAA infrastructure, to make the 3G operators can use accordant 
user authentication under the 3G and WLAN hybrid networks circumstances, and help 
the operators gathering the using information of WLAN, then form a uniform customer 
bills including the using information of 3G and WLAN. 
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Fig. 2. The loosely-coupled integration architecture of 3G with WLAN 


This integration mode has flowing characteristics: the data of two systems are inde- 
pendent from each other. 3GPP maintains the present standards unchanged, and the 
users enjoy services as before. The WLAN systems do not need to make any changes, 
and the users can get one piece of bill from the same operator. So there’s no need of 
roaming operation between WLAN and 3G, it’s the simplest integration method. In 
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this mode, using mobile IP technology can also support the roaming operation within 
two different systems. The concrete application: realize function FA (Foreign Agent) at 
the connecting node of AC and GGSN (Gateway GPRS Support Node), and set up a 
server as HA (Home Agent) in additional networks. So applying the mobile IP operat- 
ing process, and users can roam between two systems by Wi-Fi mobile converged de- 
vices. But in this mode, WLAN can’t directly visit resources and businesses in 3GPP 
networks, as the present mobile IP is not perfect whose performance can hardly meet 
the carrier-class requirements. 


3 The Integration of WiMAX with 3G 


WiMAX (Worldwide Interoperability for Microwave Access), is a broadband wireless 
access method with mobility, has higher speed and wider coverage. It can supply users 
with a 75MB transmission speed, 50Km coverage at farthest, support 120km/h 
high-speed vehicular movement, and a better hierarchical QoS guarantee in wireless 
multimedia businesses. It is mainly used in wireless communication within MAN 
(Metropolitan Area Networks). 


Table 2. Contrast parameters table of WiMAX with 3G technology 


parameters WiMAX 3G 
Working frequency _2.6GHZ frequency 3.11GHZ frequency 
Acceptance range 10km unrestricted 
Transmission speed _120Mbps 2Mbps 
Mobility 120km/h 250km/h 
Service Broadband low-speed data Voice and data 


Main technology OFMD/MIMO/OFDMA CDMA/TDMA 


As showed in table2, WiMAX mobile communication system positions at packet 
data transmission service, its peak data transmission speed can arrive at 75Mbit/s, 
much higher than 3G system. And it is mostly used for fixed, portable or low-speed 
users access, don’t back up seamless roaming under high-speed movement. But 3G 
mobile communication system has advantages of supporting high-speed roaming as 
well as a whole network covered communication service. If combine the two systems 
of WiMAX with 3G when networking, make WiMAX emphasis on achieving broad- 
band mobile and meet the high-speed service requirements in some hot area, while 3G 
emphasis on achieving high-quality voice communication and seamless roaming needs 
of mobile communication, then it can realize advantages complementary, save invest- 
ment cost, and arrive a win-win situation as well as meet the users needs of smart 
card '4, 5]. 

3G is established in mobile broadband, while WiMAX based on broadband mobile, 
the both have a nice complementarity in technology and service, so it is feasible to 
unite networks of 3G with WiMAX, even a preponderant hybrid network system. Ac- 
cording to the level of their interdependences, the integration architecture of WiMAX 
with 3G can be divided into two modes: tightly-coupled integration architecture and 
loosely-coupled integration architecture. 
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3.1 The Tightly-Coupled Integration Mode of 3G with WLAN 


As showed in figure 3, in the tightly-coupled integration architecture, the WiMAX 
system gateway connects with GGSN of 3G system through tightly-coupled interface, 
the IF shield the property of WiMAX to SGSN, make SGSN take WiMAX system as a 
separate base station. WiMAX uses the authentication, charging of 3G networks, its 
upper runs 3G relative protocol, needs some transformation to the protocol stack and 
adds corresponding interfaces to 3G protocol stacks. In the tightly-coupled integration 
architecture, the access networks of WiMAX use the same way as 3G to combine with 
the core network, and directly connect to SGSN node of 3G networks, both of the 
access network of WiMAX and 3G are equal. The WiMAX gateway conceals the de- 
tails of WiMAX network technology from 3G networks, deploys all protocols (authen- 
tication, mobility management and etc) required by 3G wireless access networks. 

This tightly-coupled integration mode will make a big attack to the core network of 
3G, because the SGSN network components of 3G core network have to been rede- 
signed to support the ascending service whose service load is far from its property. 
Among the tightly-coupled integration architecture, there also brings up some new 
requirements on users’ equipments, the mobile termination of the integration network 
can run the IEEE802.16 and 3G protocol stack at the same time. 
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Fig. 3. The tightly-coupled integration architecture of 3G with WiMAX 


3.2 The Loosely-Coupled Integration Mode of 3G with WiMAX 


As showed in figure4, in the loosely-coupled integration architecture, the WiMAX 
system gateway connects with GGSN of 3G system through loosely-coupled interface 
to achieve interconnection with each other. WiMAX system support the authentication 
based on SIM, then can share users’ data, authentication and charging function with 3G 
system. In the tightly-coupled integration architecture, the data of WiMAX network 
directly connects with public networks out of 3G core network. To 3G core network, 
there is no directly interface with WiMAX, their access networks are parallel. To make 
the users from WiMAX access network can also enjoy the service provided by packet 
switching area of 3G core network, there need to construct a data tunnel among 3G 
core network and WiMAX network, through which can transmit the data from 3G core 
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network to WiMAX access network. WiMAX access network and 3G access network 
can use different authentication, charging, mobile management mechanisms and proto- 
cols. But, to realize seamless handoff, different mechanisms and protocols must coor- 
dinate each other. 

The loosely-coupled integration mode utmost maintains the independence of Wi- 
MAX and 3G network, makes less changes to present networks, is more open, more 
advantages and competitive. But in the loosely-coupled integration architecture, as 
there is no direct tunnel between the two networks, the data and signal are all switched 
through internet, and the data received by mobile node have to pass HLR (Home Loca- 
tion Register). So the transmission is inefficiency and handoff delay is long, not easy to 
provide highly real-time service, such as video streaming transmission. 
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Fig. 4. The loosely-coupled integration architecture of 3G with WiMAX 


4 The Integration of WLAN, WiMAX and 3G Networks 


As showed in figure 5, connecting with the two network architecture of WiMAX and 
WLAN Mesh, realize the wireless broadband access of Mesh gateway node with 
WiMAX the bone back network through the broadband wireless access system. The 
MAP in WLAN Mesh network can connect with the user terminal directly, also be 
used as AP access interface of traditional WLAN and combine multi-wireless local 
area networks together by the way of wireless Mesh. On the one hand, it can achieve 
intercommunication among the traditional wireless local area networks; on the other 
hand, it can also realize the wireless broadband access. So, WLAN can satisfy some 
hot areas with higher speed data transmission, and WiMAX connects different hot 
areas together, then it can achieve a wider coverage of high-speed data access. Other- 
wise, 3G network positions at mobile subscribers’ voice communication and low-speed 
data wireless communication over the whole networks, it arrives an advantage com- 
plementary of three technologies integrates multi access ways, and forms a hierarchical 
broadband wireless access network, and provides more efficient service to different 
needs. 
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Fig. 5. The integration architecture of WLAN, WiMAX and 3G networks 


5 Handoff between 3GPP System and Non-3GPP System 


HMIPv6 (Hierarchical Mobile IPv6), is the enhanced version of IPv6. It was proposed 
for reducing signal traffic and speeding mobile connection. HMIPv6was proposed by 
Internet Engineering Task Force, mainly aimed at providing calculation function for 
Internet, and solving seamless handoff problems within heterogeneous networks. As 
showed in figure 6, HMIPv6 brings a new node- MAP (Mobile Anchor Point) as a 
local entity to manage the mobile handoff. Using mobile IPv6, the mobile node will 
send location update information to its relational nodes when each time it updates the 
location. UE directly access to Internet through common wireless access ways. To 
support the mobility, it needs to be updated in local bindings and home bindings, and 
also the updated the registration to HSS through AR. UE (users’ equipments) access 
the 3GPP target system, including authentication, address allocation and wireless re- 
sources construction. The authentication is from 3GPP system to UE, and UE after 
updating local bindings through target system, the hierarchical data can then pass the 
new transmission link. 
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Fig. 6. Mobility proposal based on HMIPV6 system 
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6 Conclusion 


The integration of all kinds of wireless networks is an inevitable trend with the devel- 
opment of wireless technology, has important realistic meaning and research value. 
This paper discusses on the heat technology-integration of 3G with WLAN, proposes 
some typical solutions, and gives out a handoff method between 3G system and non- 
3G system. But as the integration of heterogeneous wireless networks is complicated 
system engineering, referring to system architecture, protocol stack, seamless handoff, 
seamless management and security, so to realize the real seamless integration between 
wireless networks is still facing many challenges and having much work to be done. 
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Abstract. To solve the problem of shortage of low frequencies along with the 
development of mobile communication, the research on the leaky communica- 
tion system using for the mobile communication in blind and semi-blind zone 
was towards to high frequency and wide frequency area. Combining with the 
3G and 2G mobile communication frequency band, the method of expanding 
the frequency range of single mode radiation leaky coaxial cable was studied, 
and the radiation field of leaky coaxial cable whose frequency range is ex- 
panded was simulated, and the relationship between frequency range expanding 
and radiation field was analyzed, the conclusion set a basis for the application 
of leaky coaxial cable. 


Keywords: leaky coaxial cable, wide-band, mono-radiation, radiation field. 


1 Introduction 


Leaky coaxial cable is also called sequential antenna, according to the electromagnet- 
ic theory, it has some periodic or non-periodic slots on its outer conductor[1]. 
All the slots along the cable are the radiation sources of electromagnetic wave, and 
when the signal is transmitting in the cable, part of the energy is radiated out of the 
cable from the slots. The leaky electromagnetic signal can be received by the receiver 
along the cable; or signal from the mobile transmitter get into the leaky coaxial cable, 
and this communication method can make up the disadvantage of the existing of blind 
zone in the traditional communication, so that the omni-bearing duplex communica- 
tion between leaky coaxial cable and outer space can be realized easily[2][3]. Leaky 
coaxial cable is the combination of wire and wireless communication, at the same 
time is the integration of microwave and electromagnetic field and electric 
wave transmission theory, and it can be widely used in the communication in sub- 
ways, tunnels, caves, mines, high ways, ships and other blind or semi-blind zone. 
Especially along with the coming of 3G, the requirement of wide-band communica- 
tion is getting higher and higher[4], it is very necessary to expand the frequency band 
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of leaky coaxial cable, and the research on the radiation field of wide-band leaky 
coaxial cable is turned to be an important issue today[5][6]. In this paper, the method 
of expanding the frequency band of leaky coaxial cable is analyzed, and the simulated 
results show that in the mono-radiation zone, the wide-band leaky coaxial cable can 
work well in the 3G communication. 


2 Mono-Radiation Theory 


If the propagation coefficient of an uniform infinitely long magnetic current J,, in z 
direction is { , the electric field of the point p(z,,r) that out of this magnetic current 
is [1]: 


E=—jou,V xi, (1) 
Here, @ is angular frequency, [],, is magnetic hertz vector, and 
+oo 


II, =(1/ j4zan, ) } J,(e 2 /R\ dz. k, is wave number in free space, “4, is the 


co 


permeability, and R=,/r°+(z—z,)’ is the distance between view point P and mag- 


netic dipole as shown in Fig.1: 


+ co 
AZ 
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Fig. 1. Magnetic current source 


If the magnetic J,, transmits in z direction, there is only z component of [T,, [1]: 


l too e iBetky r+(z-Z)?) 


dz (2) 


mz m 
P +(z-Z9)° 


JATOL, —0o 
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Let B, = ky’ - Bp ,z- Z =rsinht , then: 


—jBz + 
a Jn e@ iPorcosht J 


= dt =-—"—H, (B rye” (3) 
" j4nOLy ~. day, Pr 
Here H a (x) is zero-order Hankel function of second kind, from (3) and (1), we can 
obtain: 
JB, ip 
Ey(1,2) = JAH Bre (4) 


Where H,” (x) is one-order Hankel function of second kind. 


The magnetic current J, of leaky coaxial cable which period of vertical slots is P 
is also a periodic function with P, and the Fourier series of it is: 


_ .2an, 


ID= Vi Ime ? (5) 


n=—0o 


Here J,,, is the amplitude of J,,, n is integral number, then the field in @ direction of 


mn 


leaky coaxial cable can be written as: 


E,(r,2) 5 Y ImByH? (Bane (6) 


and B=k,Jé,, 8, =f8+2an/p,f, is the propagation constant of electromagnetic 
wave in radial direction, {, is the propagation constant of nth spatial harmonic wave 
in z direction. 


It is known from f, =./k, — 8,’ that if 8, <0, the field falls in radial direction, 
and there is no radiation, so the surface wave exists around leaky coaxial cable; only 
if B, > 0, the electromagnetic wave can be radiated in radial direction, and that is the 
requirement of radiation: 


ky apy >0 (7) 


Now we can obtain the following expression by using k, = 27 f /c and B= ky Jé, : 
—nf, < f <—nf, (8) 
where f, = c/ PUle, +1) fh = e/ Pfe, -1) , ¢ is the velocity of light in free space, 


€,is the relative permittivity of the dielectric material in the cable, and n must be 


integral number only. 

It is obviously that there are infinitely many spatial harmonic components around 
leaky coaxial cable with periodic slots, and they are called modes because they are 
similar to the transmission modes in waveguide[7]. Only if n<—1, the radiation 
exists, and (8) is the frequency range which can product radiation wave[8]. 
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The diagram for radiation frequency band of different harmonics is illustrated in 
Fig. 2: 
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Fig. 2. Spatial harmonic modes 


According to the interference of leaky wireless wave from slots, the frequency 
band can be divided into 3 areas[9]: 


1. The frequency band in which B, is imaginary number, that is in f < f,. In this 


area, there is only surface wave around leaky coaxial cable and the field of them can 
be disturbed very easily, the transmission distance is short, so it is not suitable to be 
used in the mobile communication. 

2. The frequency band in which B, is real number and there is only mono-radiation, 


and that is in f, < f <2/, . In this area, there is only -1th mode radiation wave around, 
and this mode is needed in the mobile communication. 
3. The frequency band in which se is real number and there is multi-modes radia- 


tion, when the frequency increases to 2f,, -2th mode radiation wave appears, and 


along with the increasing of the frequency, -3th, -4th, etc. modes begin to radiate, but 
different modes have their different transmission constants, the result of it is the 
standing wave, which will product strong interference between every modes, and the 
fluctuation of field make the quality of mobile communication fall, so it is necessary 
to suppress the high order spatial harmonic component, that is to make the frequency 
between f, and 2f,, realize mono-radiation, and the frequency band is f, . 


3 Wide-Band Leaky Coaxial Cable 


To expand the frequency band of leaky coaxial cable, all the high order spatial har- 
monic in the -1th radiation band must be suppressed, and the more the modes are 
suppressed, the wider the frequency band is. There are three methods to achieve it: 


1. Add some new slots on the outer conductor, and their size and shape are the 
same as the original ones. The according high order harmonic can be suppressed by 
adjusting the distance between the new and old slots. 
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2. Adjust the inclined angle of slots while keep the arrangement and length of orig- 
inal slots invariant to suppress the high order harmonics. 

3. Adjust the length of the slots while keep the arrangement and inclined angle of 
original slots invariant can also suppress the high order harmonics. 


The first method is often chosen because the latter two methods are hard to achieve. 
The following is leaky coaxial cable with zigzag slots which is usually used in our 
country: 


Fig. 3. Leaky coaxial cable with zigzag slots 


The field around it is[10]: 
(r, Q,z z) => Z, (I= el") R (n,. r, 9) e FB (9) 


n=—00 


Here Z, is periodic function in z direction, R(7,r,@) is related with r , g and radial 


jn 


equals to zero, nth harmonic can be 
suppressed, so there is odd order modes of leaky coaxial cable with zigzag slots, and 
the frequency band is between f, and3f, [11]. 


transmission constant 77. When n is even, l—e 


The following figure is leaky coaxial cable with double zigzag slots in one period: 


MEX 1 \X 


Fig. 4. Leaky coaxial cable with double zigzag slots 


The same as above, the frequency band of this kind of cable is between f, and Sf, , 


and it is four times of original band. So we can know that the expanding multiple of 
frequency band is two times of the pairs of zigzag slots in one period of leaky coaxial 
cable, that is if there are m pairs of zigzag slots in a period, the frequency band can 
get 2m to increase after expanding. 

According to the International Telecommunication Union standardization, the fre- 
quency band for 3G communication is 1885MHz to 2025MHz, and in general case, 
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the slot period of leaky coaxial cable is 0.22m, €, = 1.25, then we can obtain that the 
mono-radiation frequency bands of leaky coaxial cable with periodic slots, zigzag 
slots and double zigzag slots are 643.83MHz—1287.66MHz, 643.83MHz— 
1931.49MHz and 643.83MHz—3219.15MHz respectively. So the last kind of leaky 
coaxial cable is suitable for using in 3G mobile communication in blind or semi-blind 
zones. 


4 Simulation Results 


To check the performance of the leaky coaxial cable after frequency band expanding, 
the simulated radiation fields analyzed by HFSS are as follows. The cable’s parame- 
ters are: the length of it is 300m; the radiuses of inner and outer conductor are 9mm 
and 22.8mm; the incline angle of slot is m/4 ;€, =1.25, and the mono-radiation fre- 


quency band is 643.83MHz—1287.66MHz. 


(a) Frequency is 1885MHz (b) Frequency is 2025MHz 


Fig. 5. Radiation field of leaky coaxial cable with zigzag slots 


These two figures are the radiation fields of leaky coaxial cable with zigzag slots at 
the lowest and highest frequency of 3G. Figure (a) shows that the leaky coaxial cable 
is in the mono-radiation status when the frequency is 1885MHz, and the field of it is 
smooth, so it can work well in the mobile communication. When the frequency in- 
creases to 2025MHz, the leaky coaxial cable gets out of mono-radiation, the field 
showed in figure (b) is not stable, and more side lobes which can disperse the energy 
of leaky coaxial cable or import some unnecessary interference emerge, so if the cable 
works under this frequency, the quality of mobile communication will be affected, 
and the communication distance will be cut short too. 

The following two figures are the radiation field of leaky coaxial cable with double 
zigzag slots in one period at the frequency of 1885MHz and 2025MHz, and its fre- 
quency band has been expanded to 643.83MHz—3219.15MHz. 
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(a) Frequency is 1885MHz (b) Frequency is 2025MHz 


Fig. 6. Radiation field of leaky coaxial cable with double zigzag slots 


We can know from Fig.6 that leaky coaxial cable is in the mono-radiation status no 
matter the frequency is 1885MHz or 2025MHz, the main lobe remains in the original 
position, and the field is stable, so it can be used in 3G mobile communication. 


5 Conclusion 


The simulation results show that change the slot structure of leaky coaxial cable in 
one period can expand the frequency band and application range, and turn the cable 
be useful in 3G communication. The radiation character and main lobe of leaky 
coaxial cable does not change after band expanding, and the side lobe energy decreas- 
es at the same time. So it is effective of adding zigzag slots in one period to expand 
the frequency band of leaky coaxial cable until to the frequency range of 3G mobile 
communication, but if there are enough many slots in one period, because of the close 
distance, the electromagnetic wave from different slots will disturb with each other, 
so the limitation of the adding slots and the radiation field of leaky coaxial cable with 
more zigzag slots will be studied in the future. 
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Abstract. To optimize the design of flow field of nozzle in aluminum 
roll-casting, a coupled fluid-thermal finite element analysis using ANSYS 
software, was performed to explore the distributing of velocity and temperature 
of melt aluminum in spacer-free nozzle by MATLAB. Curve of velocity was 
gently sloping but curve of temperature was steep at outlet of spacer-free nozzle. 
It was explored that geometrical sizes of nozzle should be pre-designed to get 
better curves of velocity and temperature before placing spacers into nozzle. 


Keywords: roll cast; spacer-free nozzle; flow field; coupled fluid-thermal; finite 
element analysis. 


1 Introduction 


Modern twin-roll casting is appealing urgently for discovery and research of optimi- 
zation of processing. Normal roll casting doesn’t request stricter design for flow field 
and thermal field, so spacers inside nozzle are used to hold flow’s passage and separate 
the flow. There are many kinds of spacers used in nozzles, and at the normal speed of 
roll casting, they are not sensitive to the uneven spreading of velocity and temperature 
of flow field of melt aluminum, especially with lower speed and thicker sheet of roll 
casting [1, 2]. 

But for high speed and thin sheet of roll cast, those kinds of nozzles with spacers are 
not good enough. As industrial experiments cost too much, simulation of flow field and 
thermal field of melt aluminum in nozzle is used before on-site experiments. Usually 
comparing with a lot of results of different designs, and combining with experience of 
engineering, then the most reasonable optimized plans are chosen for doing industrial 
experiments. 

From former work, conclusion is drawn that one-spacer nozzle has potential capac- 
ity to be designed for higher speed and thinner sheet of roll cast. Before optimized 
designing one-spacer nozzle, the shape of nozzle without spacer should be pre-design 
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first, which will be discussed in this paper. Section 2 gives the detailed description of 
the problem; Section 3 explains characters of this model of flow field, and describes the 
coordinating and boundary conditions used in the analysis; Section 4 presents and 
analyzes the simulation results; Section 5 is conclusion. 


2 Description 


Spacer-free nozzle’s 

main geometry SIZES Entrance of front-box 
of flow fluid include 
thickness and width 
of entrance and outlet, 
and the slope of side 
edges from entrance 
to outlet. Due to the 
limitation to produc- 
ing process, the width 
of either entrance or Fig. 1. Schematic of flow field of front-box & 

outlet is fixed: if the spacer-free nozzle 

width of sheet is de- 

cided, then the width 

of outlet of nozzle is determined. Melted aluminum flows from entrance of front-box, 
crossing the interface of front-box and nozzle, passes nozzle, and finally outflows at the 
outlet of nozzle, as shown in Fig. 1. Then the fluid of melt metal with certain spreading 
of velocity and temperature will arrive at cast-rolling-area between the gap of two roll- 
ers. This model of front-box and nozzle is prepared for the future simulation model 
of cast rolling, which will include multiphysics fields considering casting rollers, melt 
cave area, and the cast-rolling area altogether. 

Model of flow field of both front-box and nozzle is built to take the liquids in both 
the front-box and the nozzle into consideration all together. As an isolated model of the 
nozzle field will bring inconsistency to the model at the interface of entrance, a merged 
field of both front-box and nozzle will avoid errors caused by separated modeling. 
Geometrical sizes are shown in Table 1. 


Interface 


Y~™ Outlet of nozzle 


Table 1. Geometrical sizes of nozzle 


Name of Width of | Widthof Thickness Thickness Depthof Speed of roll 


size entrance outlet of entrance _ of outlet nozzle __ casting sheet 
Along axis x x Zz Zz y y 
Units m m m m m m/s 


Value 0.780 0.890 0.016 0.007 0.370 0.0167 


Pre-design and Analysis of Flow Field of Spacer-Free Nozzle of Aluminum Roll-Casting 653 


3 Analysis of Modeling 


3.1 Mesh Type 


In module of FLOTRAN of ANSYS software, there are two different meshing me- 
thods: free mesh and mapped mesh, the 
former is strongly recommended to 
module of FLOTRAN, as the latter 
suitable to module of Structure. In the 
analysis of fluid FEA, mapped mesh 
method is helpful to reduce the calcu- 
lating error, avoid divergence, and ob- 
tain more accurate results. Even if the 
models are complex, the mapped mesh 
method should be applied as much as 
possible. Fig. 2 is mapped meshing 
model of flow field of front-box and 
nozzle. 


Fig. 2. Mapped meshing model of flow field 


3.2 Boundary Condition 


3.2.1 Fluid Field Boundary Conditions 
At the entrance of the front-box 
the boundary condition is set to 
the Entrance Condition, v,=0, 
Vy= Vo, V=O0, where vVx,Vy,Vz! 
variable component of fluid 
velocity vector whose units is f U\e 
[m/s]. At the exit of the nozzle it { —_. } \os ss 
is set to the Free Exit Condi- [ 
tions, P=0, where P: Pressure of 
the fluid whose units is [N/m’]. 
At the bottom of the front-box 
and at nozzle where fluids are 
constrained they are set to the 
Fixed Boundary Conditions, v, 
=0, vy =0, v, =0. At the top of 
the front-box the flowing speed 
along the Z-direction is set to 
zero, Vv; =0, vy =0. 


F =-.0091L 
a = 


Fig. 3. Isosurface of contour display of v, 


3.2.2 Thermal Boundary Conditions 

At the entrance of the front-box the boundary condition is set to temperature: f=fo, 
where t: temperature of nodes, its units is K. At the bottom of the front-box the heat flux 
to the boundaries between melted aluminum and front-box is applied: h=h;. At the top 
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of the front-box the heat flux 
to the boundaries between 
melted aluminum and air is 
set: h=h. At internal nozzle y hk 

surfaces where fluids are xy |_| ¢ NE 
constrained corresponding {| \ } ; y } F : 
heat flux to the boundaries  =7~ \ \. “S_ aa r  =876.048 
between melted aluminum | —— OP | [a =s4-08 
and nozzle are set: h=h3, lL \e “Soo” ; y 

where /h represents: heat flux | Sadie 

to the boundaries whose units I~ S a x » 

is [W/m’]. [3] 


4 Result of Simulation Fig. 4. Isosurface of contour display of temperature 


Postprocessing by ANSYS[4]: ANSYS’s default result display of flow fields in 
front-box and nozzle is in vector display, but not as clear as contour display. Plotting of 


analysis result of 
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files of ANSYS, by using MATLAB software, as shown in Fig. 5. Postprocessing of 
results by MATLAB figured out absolute and fractional errors of velocity and tem- 
perature of melt aluminum at outlet of nozzle, which could judge how suitable the flow 
field of nozzle will be for producing with higher casting speed and thinner thickness of 
aluminum sheet. 


5 Conclusion 


Nozzle without spacer is disscussed for optimizing flow fluid of nozzle, which will be 
set spacers to obtain even spreading of velocity (vy) and temperature (f). From analysis 
of spacer-free nozzle, it is concluded that vy is smooth and flat, as ¢ is not at outlet of 
nozzle, which means that spacer-free nozzle is good for vy but bad for t. Temperature at 
outlet shows a convex curve, and maximum of vy and f¢ are at the midpoint of width, so 
spacers at mid part have to be added to reduce the detrimental effect. Further more, it is 
supposed that optimazing the geometrical sizes of nozzle without spacer, is helpful to 
get better shape of nozzle which will serve explanate curve of both vy and ¢ before 
placing spacers in nozzle[5]. 
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Abstract. Optimization of nozzle was crucial process to improve quality of 
production of roll casting, and nozzle without spacer should be pre-designed 
before placing spacer in it. Using orthogonal experiment design, three main 
structure dimensions and the speed of roll casting sheet were chosen as 4 factors, 
to make orthogonal array of 3 levels for simulation of nozzle. It was concluded 
that an optimal combination of factors and levels could be provided by ortho- 
gonal experiment, which might achieve optimum of good distributing of velocity 
and temperature of flow fluid in nozzle. The optimized result would be used as 
the original condition of integrated design of nozzle and spacers. 


Keywords: orthogonal experiment design; spacer-free nozzle; roll cast; coupled 
fluid-thermal; finite element analysis. 


1 Introduction 


Accurate analysis and design of modern twin-roll casting is appealed for producing 
aluminum roll casting sheet with high speed and thin sheet [1, 2]. During many tech- 
nological parameters, the distributing of velocity and temperature of flow fluid at outlet 
of nozzle used in roll casting process, is a key to get high quality sheet. As there are 
some main geometrical parameters of nozzle to consider, such as sizes of nozzle, shape 
and location of spacers and so on, so optimization methods should be introduced to 
improve geometry parameters. 

The orthogonal experiment design is chosen here as optimization algorithm for 
spacer-free nozzles’ geometry. It can offer excellent factor design, and find an optimal 
combination of factor levels that may achieve optimum [3,4]. 
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Section 2 gives schematic of main geometry parameters of spacer-free nozzle which 
will act as factors of orthogonal experiment; Section 3 explains orthogonal arrays used 
in this experimental design; Section 4 presents and analyzes simulation results of or- 
thogonal experiments; Section 5 is conclusion. 


2 Description 


For batch simulation, structure of model 
should be simplified to cut down the time 
needed for modeling and analysis. Struc- 
ture of nozzle without spacer is all built by 
planes, and it is already very simple, so 
only front-box can be changed for easier 
drawing. Without changing shape of 
curves on horizon, bottom of front-box is 
from curved surface to planes, as shown in 
Fig.1. 

It is supposed that sizes of front-box are 
fixed, so are width (0.78 m) of entrance 
and depth (0.37 m) of nozzle. Then three 
other geometrical parameters of nozzle, as 
shown in Fig. 2: width of outlet (A), 
thickness of entrance (B), and thickness of 
outlet (C) can be optimized. Besides, 
speed of roll casting sheet is worth consi- 
dering too [5]. Values range of these four 
parameters is shown in Table 1. 


Fig. 1. Schematic of flow field of front-box 
& spacer-free nozzle 


Width of entrance __. Thickness of entrance B 
\ 


Depth of nozzle 


H Thickness of outlet C 


Width of outlet A 


Fig. 2. Front view and side elevation of spacer-free nozzle 
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Table 1. Range of optimization parameters 


Width of Thickness of Thickness of 


Speed of roll 
outlet entrance outlet i 
a B C casting sheet 
Units m m m m/s 
Upper limit 0.920 0.019 0.007 0.167 
Lower limit 0.860 0.013 0.003 0.0167 


3 Orthogonal Experiment Design 


3.1 Choose Orthogonal Array for Optimization Parameters 


Orthogonal arrays are used in this experimental design in order to explore an optimal 
combination of geometry values of spacer-free nozzle, shown as factor levels that may 
achieve optimum.[] Four factors chosen here are width of outlet, thickness of entrance, 
thickness of outlet, and speed of roll casting sheet. Orthogonal array is shown in 
Table 2. 


Table 2. Orthogonal array of 4 factors and 3 levels of each factor 


Width of Thickness of Thickness of 


Number of Speed of roll 
experiment outlet aa putes casting sheet 
A B C 
Units m m m m/s 

1 0.860 0.013 0.003 0.0167 

2 0.860 0.016 0.005 0.0835 

3 0.860 0.019 0.007 0.167 

4 0.890 0.013 0.005 0.167 

5 0.890 0.016 0.007 0.0167 

6 0.890 0.019 0.003 0.0835 

7 0.920 0.013 0.007 0.0835 

8 0.920 0.016 0.003 0.167 

9 0.920 0.019 0.005 0.0167 


3.2 Finite Element Analysis of Each Experiment 


Proceeding of coupled fluid-thermal analysis with ANSYS software is seen in refer- 
ence [6,7]. 


3.3 Evaluation System 


To judge how suitable the flow field of nozzle will be for producing with higher casting 
speed and thinner thickness of aluminum sheet, stricter analysis of result by software 
MATLAB is utilized to make comprehensive graphics of flow fluid’s velocity and 
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temperature on midline of nozzle’s outlet surface: two degrees of freedom of nodes in 
finite element analysis. Fig. 3 is an example of comprehensive graphics from experi- 


ment No. 4. 


Based on industrial experience, on midline (y=0, z=0) of outlet of nozzle, the best 
distributing of velocity is a flat line which parallel to outlet; and that of temperature is 
also a flat line, but parts of two sides of midline will be a little higher temperature than 
mid part. So the shape and error of each curve of velocity and temperature of flow fluid 


of outlet are decisive of evaluation. 


4 Result 


According to 9 com- 
prehensive graphics 
of results, first of all, 
compare every 3 re- 
sults of the same 
speed. For example, 
at speed of 0.167 m/s, 
there are experiments 
of No. 3, 4, 8, appar- 
ently it is judged by 
above method, that 
No. 8 is better than 
other two, and No. 4 
is better than No. 3. 
Secondly, compare 
results of other same 
factor with same 
level, to get other 
order of superiority. 
But finally it is rea- 
lized as an original 
and rough judgment, 
which cannot supply 
accurate analysis 
between results which 
are of different levels 
and factors. 


5 Conclusion 


Orthogonal experiment design is used to make orthogonal array of 3 levels of 4 factors 
for simulation of nozzle, while three main structure dimensions and the speed of roll 
casting sheet are chosen as 4 factors. An optimal combination of factors and levels 
could be provided to achieve optimum of good distributing of velocity and temperature 


vy [m/s] 


= vy [m/s] 


t[K] 


0.5 


Fractional error of vy 


Fractional error of = vy 


Fractional error of t 


0.5 


0.15 
0.1 
0 0.5 
x 10 
3 
2 
1 
of N 
We 
-1 / 
2 
0 0.5 


Fig. 3. Comprehensive graphic of vy & t at 0.0167 m/s 
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of flow fluid in nozzle. Experiment No. 8 is one of better results, but to provide ap- 
propriate designs for investigating the main effects in order of priority, quantitative 
analysis of simulation result is needed and will be discussed later. Once the optimized 
result is decided, it can be used as the original condition of integrated design of nozzle 
and spacers. 
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Abstract. Conventional analysis of orthogonal experimental results was applied 
to optimize structure of spacer-free nozzle in aluminum roll-casting. Weighting 
factors of velocity and temperature were determined in analysis with extreme 
deviation. It was concluded that good distributing of flow fluid’s velocity and 
temperature, would come with bigger width of outlet, modest thickness of en- 
trance, and smaller thickness of outlet. The main effects in order of priority were 
investigated and advices on improving performance were given for industrial 
experiments. 


Keywords: orthogonal experiment design; extreme deviation; spacer-free noz- 
zle; roll cast; finite element analysis. 


1 Introduction 


To achieve high quality production of twin-roll casting with higher speed and thinner 
sheet [1, 2], the optimization design of flow fluid of nozzle should be attached great 
importance to. Some geometrical parameters of structure are decided by special pro- 
ducing conditions, such as width of aluminum sheet, but other geometrical parameters 
can vary within a certain range, which are suitable to design levels of orthogonal ex- 
periment design. At the beginning of optimization design of nozzle, geometrical sizes 
of space-free nozzle should be considered to optimize at first, then spacers added. The 
orthogonal experiment can find an optimal combination of levels of factors [3, 4] for 
finite element analysis of flow fluid of spacer-free nozzle. As discussed before [5], 
three geometry parameters of structure of nozzle and speed of roll-casting sheet are 
chosen as factors of orthogonal design. 

Section 2 serves preliminary analysis of experimental result; Section 3 presents 
weighting factors for analyzing simulation results of orthogonal experiments; Section 4 
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is result of evaluating by extreme deviation; Section 5 draws conclusion and gives 
suggestion for improvement of orthogonal experiment. 


2 Preliminary Analysis of Experimental Result 


Comparing 3 cases 0.11 , 03 
with each same ee > oe 
LP ss a ux 0.25 | me 
speed, sort these > 91\)~ ‘| is ro *] 
cases by order of € | | E ne l || 
superiority. POP gg | got] | 
example, at speed of 2 os} | 
0.0835m/s, there are + > = Q | | 
0.08 (0.05 
three groups of No. 0 0.5 1 0 0.5 1 
2, 6 and 7. It can be x10 
observed from %4—— = = 4 
Comprehensive = 6 als - - 
graphic of vy & tf 982 va _~ 5 ae 
that Group 6 (as ¥ ane 5 0 Va * 
shown in Fig. 1) is ~ 980} / N = ra \ 
better than other two / \ = ial 
groups, and group 2 978 & A 
(as shown in Fig. 2) 9 0.5 1 9 0.5 1 
is better than group 
7. But if analysis Fig. 1. Comprehensive graphic of vy & t of group No. 6 
with more than one 
factor, it is hard to 
judge which one is ae By 28S ; : 
better, and how _94,, 2 ——~_ % 03,—- -— - —~ 
much it is better ¢ Lo ™~| e oe we \| 
than others. Quan- S01) | S) | 
oe : ee 3g 0.2] | 
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3 z= 3 sS 
ment has to be 0.08 E04 
lied 0 0.5 1 0 0.5 1 
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986 - 4 
eet 7 7 7 S = 2 7 
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— 984 — = - 
Factorfor @& os Baal) ye 
Quantitative ~ 4..| / a s ie a 
° i \ a -2 \ 
Analysis \ e 
ae 0.5 ey 0.5 1 
Measurement of ; ; 
each group’s supe- 
riority is to adjust Fig. 2. Comprehensive graphic of vy & t of group No. 2 


every possible 
weighting factor to 
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fit for the order of visual analysis by observer or analyst. Usually more than one 
weighting factor will be settled. From coupled fluid-thermal finite element analysis, 
there are two degrees of freedom are involved: velocity and temperature of flow 
fluid, which suggests that two weighting factors of velocity and temperature will be 
considered. 


3.1 Weighting Factor of Velocity (W) 


As 3 different kinds of speed of roll-casting sheet used in design, even a same model 
will show different distribution of velocity and temperature, so weighting factor of 
velocity (W) must be designed as a synthesized factor, which means W is the last one to 
be multiplied by. 


3.2 Weighting Factor of Temperature (Ws) 


With the same simulation model, the higher speed of roll-casting sheet increases, the 
more even distribution of temperature is. So weighting factor of temperature (Wr) only 
applies to item of temperature. By lots of comparing and calculating, weighting factors 
are chosen in Table. 1: 


Table 1. Values of weighting factors 


Velocity of roll-casting Weighting Factor of Weighting Factor of 
[m/s] velocity: W temperature: Wr 
0.0167 1.0 1.4 
0.0835 1.1 1.2 
0.167 1.2 1.0 


4 Result of Evaluating 


Conventional analysis [6] by extreme deviation is as below in Table. 2: 
AK: extreme deviation, is equal to maximum minus minimum of one row. 


Values of K/~K3 corresponding to Level 1~3 are calculated as below: 


K/A=25+24.64+19.2=68.84 
KA=244+24.4427.72=76.12 
K;\=22.88+30424.5=77.38 
KP=25+24+22.88=71.88 
KP =24.64+24.44+30=79.04 
K;P=19.2+27.72+24.5=71.42 
K©=25+27.72+30=82.72 
K,°=24.644+24+24.5=73.14 
K;°=19.2+24.44+22.88=66.48 


666 Y. Zhou et al. 
Table 2. Visual analysis of experimental result of simulation of nozzle 


Evaluation system: S=W*(Sv+ Wt*St+Sq) 
Width Thickness Thicknes Speed of WeightinGrade Grade Grade Final 


Ne of outlet of s of roll  gfactor onvy — with on grade 
entrance outlet casting of weighting vy*t 
velocity factor 
A B C D W Sv Wt*St Sq S 
1 860 13 3 0.0167 1.0 9 1.4*5 9 25 
2 860 16 5 0.0835 1.1 7 1.2*7 7 24.64 
3 = 860 19 7 0.167 1.2 4 1.0*8 4 19.2 
4 890 13 5 0.167 1.2 6 1.0*8 6 24 
5 890 16 7 0.0167 1.0 8 1.4*8 8 24.4 
6 890 19 3 0.0835 1.1 9 1.2*6 9 27.72 
7 920 13 7 0.0835 1.1 5 1.2*9 5 22.88 
8 920 16 3 0.167 1.2 8 1.0*9 8 30 
9 920 19 5 0.0167 1.0 9 1.4*5 8.5 24.5 
KI 68.84 = 71.88 82.72 
K2 > 76.12 79.04 73.14 
K3 77.38 ~=—-71.42 66.48 
AK 854 7.62 16.24 


The grade of effect on experiment result calculated by extreme deviation of each 
factor, results that sequence from high effect to lower effect, is in order of factor C, A 
and B. Optimization result in the orthogonal experiment is from group No. 8. 


5 Conclusion 


By the result of evaluation, it is concluded that thickness of outlet (C) is the key factor 
of all, width of outlet (A) is less important factor, and minor factor is thickness of en- 
trance (B). To obtain good distributing of flow fluid’s velocity and temperature, bigger 
width of outlet, modest thickness of entrance, and smaller thickness of outlet will be 
helpful. 

Advices from industrial experiments: 

Width of outlet (A) determined by machine of roll-casting and production, it cannot 
be adjusted casually, so it should be treated as an unchangeable parameter, and instead 
width of entrance can be designed as a factor. 
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In this experiment, speed of roll-casting sheet has not been treated as an independent 
parameter, but as a factor. Now it is explored that next design will be project of 3 
factors with 3 levels, without speed of sheet, it will be more accurate and reasonable. 
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Abstract. To optimize shape of single-spacer nozzle, width of single spacer was 
chosen to be the only optimized parameter to discover its influence on distribu- 
tion of velocity and temperature of nozzle’s flow fluid. According to results of 
coupled fluid-thermal finite element analysis with four different widths of single 
spacer in nozzle, it was discovered that the greater width was, the more uneven 
of distribution of velocity and temperature of flow fluid at outlet of nozzle 
would be. 


Keywords: optimization design; single-spacer nozzle; roll cast; coupled flu- 
id-thermal; finite element analysis. 


1 Introduction 


Single-spacer nozzle used in aluminum roll casting is one of structures which had 
potential to obtain good flow fluid which matches the cooling capacity of casting 
rollers [1]. Theoretically an integrated scheme of optimization of single-spacer nozzle 
should be followed as these steps: |. analysis spacer-free nozzle’s geometrical para- 
meters by orthogonal experiment; 2. determine shape and variables of single spacer, 
and analysis single-spacer nozzle; 3. analysis of combining variable of single spacer 
with geometrical parameters of nozzle [2, 3]. 

The optimization scheme has been designed for industry experiments, limited by the 
factory’s technical conditions, many structure parameters of nozzle have been fixed, 
and it’s why only one variable of single-spacer nozzle discussed in this paper. 
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Section 2 gives optimization scheme of single-spacer nozzle; section 3 displays 
boundary conditions applied to finite element analysis of nozzle; section 4 shows result 
of analysis; section 5 indicates conclusion. 


2 Optimization Scheme of Single-Spacer Nozzle 


Under limited conditions of industrial actual experiments, there are many fixed para- 
meters of nozzle, and a few parameters of spacer can be designed as optimization pa- 
rameters this time. 


2.1 Modeling with Geometrical Parameters of Spacer-Free Nozzle 


Spacer-free nozzle’s main 
geometrical parameters are 
shown in Fig. 1 and Table. 1. 
First, since front-box is ap- 
pointed, then size of nozzle’s 
entrance is followed fixedly 
too. Furthermore, nozzles are 
custom-made, and _ thick- 
nesses of nozzle’s entrance 
and outlet are hard to adjust, 
so they have to be change- 
less. At last, geometrical 
parameter of nozzle is width Fig. 1. Geometrical parameters of flow field of 
of outlet, it is determined as spacer-free nozzle with front-box 

constant. Finally all sizes of 

nozzle are fixed. 


Table 1. Parameters of spacer-free nozzle for 3-d modeling 


Width Width Thickness Thickness Depth Speed of 


of of outlet of entrance of outlet of roll-casting 
Name 
entrance B C D nozzle sheet 
A E 
Units m m m m m m/s 


Value 0.89 1.300 0.017 0.007 0.37 0.0835 
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Fig. 2. Geometrical parameters of single spacer in nozzle 


2.2 Variable of Single-Spacer Nozzle 


Shape of the single spacer is designed as droplike curve created by spline curves which 
use fitting curves to link several key points. Coordinates of key points can be calculated 
by program with position, depth of spacer, and width of spacer at interface of front-box 
and nozzle (as shown in Fig. 2). Depth of spacer in this plan is 0.2 m selected by former 
results of both simulation and real experiments. Width of spacer at interface is the only 
variable in this optimization scheme, and values are listed in Table 2: 


Table 2. Values’ list of width of single spacer 


Group No. 1 2 3 4 
Width of spacer [m] 0.1 0.15 0.2 0.3 


3 Finite Element Analysis of Nozzle 


Mesh type and boundary conditions of coupled fluid-thermal FEA are shown in ref- 
erence [3~5]. 


4 Result 


Simulation of four groups brings about results in Fig. 3. Workplane of z=0 is the most 
convincing place where contour display of vy (y - axis component of velocity) & t 
(temperature) shows most details. [6] 

Comprehensive graphics are drawn by MATLAB in Fig. 4, which come from nodes’ 
two degrees of freedom: vy & f on midline (y=0, z=0). 
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Fig. 3. Isosurface of contour display of vy & t on workplane (z=0) of four groups 
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Fig. 4. Comprehensive graphics of vy & t on midline (y=0, z=0) of four groups 


From Fig. 3 and Fig. 4, it shows that the greater width of single spacer is, the more 
and sharply sunken curves of vy & t around mid part of nozzle’s outlet are. Especially 
when width is equal to 0.3 m, the sunken has rapid swing than others. 


5 Conclusion 


Though in this case, a lot of factors could not be set as optimized parameters, and only 
width of single spacer on interface of nozzle and front-box acts as variable, but it still 
can be discovered that smaller width of spacer will produce smoother distribution of 
velocity and temperature at nozzle’s outlet. Width of spacer will also influence width of 
entrance besides spacer, here comes another explanation of why 0.3-m-width spacer 
has such sudden change of curves of vy & f, it is because area of entrance is much 
smaller than those of other groups, as volume of flow (which is approximately equal to 
area plus velocity) is constant, so flow’s velocity through entrance must be bigger than 
others, and partially rising of velocity increases uneven of distribution of vy & f. 
Analysis of single-spacer nozzle can be reference for further optimization of 3C-spacer 
nozzle. 
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Abstract. Domestic financial enterprise data management applications general- 
ly brings together the vast amounts of data, but can not find the relationship and 
business rules exists in the data to do risk prediction assessment. Therefore, the 
domestic financial companies need to accelerate the pace of information tech- 
nology in regions of integration of customer resources, business analysis and 
investment decisions. This paper analyzes the risk assessment approach of 
banks, mainly focuses on the analysis of association rules data mining in bank 
risk assessment, and discusses the working principle of improved association 
rules algorithm genetic algorithm in commercial bank risk assessment. We de- 
scribed the methods and processes of system application. We select the matrix 
form, only scan the database once, and use the method of selecting assumption 
frequent items and numbers, find the frequent item sets through high end item 
sets, minimize the number of candidate data sets, greatly improve the efficiency 
of the algorithm. 


Keywords: bank; risk assessment; association rules; algorithm. 


1 Introduction 


With the extensive application of database and computer networks, demands on bank- 
ing sector increased continuously. In the face of three key factors, change, competition 
and customer, which impact and determine the development of bank, if there is no in- 
formation technology, it will become increasingly difficult for banks to understand, 
grasp and respond to change and increasingly difficult to integrate resources, planning 
restructuring to cope with competition, also difficult to achieve their business process 
reengineering and strategic tasks of intelligent marketing and management decisions. 
Comprehensive information management is the real core competitiveness of bank. 
Therefore, domestic commercial banks all consider the development of entering infor- 
matization as an important strategic move, established a relatively perfect system of 
financial informatization. Since the end of 2006, China opens the full domestic 
financial markets to WTO. Foreign banks are allowed to engage in RMB totally, the 
competition between domestic and foreign banks become more intensive. In this global 
financial crisis, although China's banking sector was not seriously impacted itself, the 
market turbulence remains intense, and China’s banking sector also needs to seize the 
opportunity, make great efforts to promote the process of informatization of domestic 
banking sector. 
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For credit card, here exist the issues of malicious overdraft and fraud which bring a 
great deal of risk to bank. Therefore, card issuers should take effective measures to 
prevent risks in advance, to quantify the credit rating to applicants' qualifications and 
credit, and then assist decision-makers to decide whether to give the credit card to the 
applicant. Usually bank judge the credit of applications through statistical techniques 
and experience, however, with rapid increase of credit card users and the trading vo- 
lume, Experience alone is not enough to effectively make the right judgments, there- 
fore, we need to introduce intelligent information processing technology to provide 
decision support to decision-makers. In this paper, we focus on the association rules of 
intelligent algorithm, discussed and researched upon risk assessment of banks. 


2 Bank Risk Assessment Methods 


Bank risk refers to the possibility of encountering economic loss during operating by 
various factors, or just the possibility for banks to meet assets and income loss. Ac- 
cording to the cause of risk, the risks include credit risk, market risk, interest rate risk 
and legal risk, credit risk is the current key risk facing banking sector and also the main 
research of this paper. Since the 30's of 20th century, bank credit risk assessment me- 
thod has mainly gone through 3 stages, judge according to experience, statistical analy- 
sis and artificial intelligence. 
Mainly includes the following methods. 


e Expert Judgment, in the initial phase of the credit rating, as the historical data 
information of trading partners is not enough; the level of trading partners’ cre- 
dit is entirely based on subjective experience of credit experts. This method is 
not efficient, costs high, and often have inconsistent conclusions. 

e Scoring method, Banks and credit rating companies based on a pre-designed 
set of standardized indicators system, to rate each indicators of risk status 
about trading partners and customers, then average the rate according to the 
importance, and make the totally score the main judgment to customer risk rat- 
ing. This method requires risk management experts to set indicator and impor- 
tance according to their experience. The scoring of each indicator also need 
experts to use their experience and feelings, therefore, the level and experience 
of experts has a great impact on the effectiveness of ratings. 

e Model approach, the method of risk rating system is based on trading partners 
or customers’ historical database. Built the probability statistical model on his- 
torical data, including the discriminate analysis model, probability of default 
measurement model and loss given default rate measurement model. This me- 
thod has the advantage of high efficiency; low cost, high accuracy measure- 
ment of default risk factors, inadequacies is difficult to directly enter the model 
of qualitative indicators, making it difficult to reflect the qualitative indicators 
of information. 


3 Data Mining 


With the rapid development of database technology and the widely application of 
management systems, people accumulated more and more data. Many important 
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information are hidden behind data, people want to have higher level of analysis to 
make better use the data. 

The current database system can efficiently implement data entry, modification, 
statistics, query and other functions, but can not find the relationships and rules that 
exist among data, thus can not predict the future trends according to the current data. 
As currently lack of means to detect the knowledge behind the data mining, this led to 
the “data explosion but lack of knowledge” phenomenon. Therefore, intelligently and 
automatically valuable knowledge and information research among large amounts 
of data, know as data mining, is of great practical significance and wide application 
prospects. 

From the technical point of view, Data mining is the process of the extraction of 
implicit, and unknown but potentially useful information and knowledge among a lot 
of, incomplete, noisy, fuzzy, random, real data. From the perspective of business appli- 
cations, data mining is a new business information processing technology. Its main 
feature is a commercial database or data warehouse large amounts of data extraction, 
transformation, analysis and pattern processing, to extract the key of knowledge sup- 
porting business decisions, from a database or data warehouse model to automatically 
find related business. 


Data a 


Predict Model we Model 


I 
Sd \ Discover 
Gath 


Classification Return analysis Predict ther Accumulate Associate series 
Fig. 1. Classification of data mining. 


In short, data mining is actually a class of in-depth data analysis method. Data 
analysis itself has many years of history, only that the data collection and analysis in 
the past targeted at scientific research. But data mining is to do exploration and analy- 
sis on a large number of enterprise data in accordance with corporate business objec- 
tives. It’s the effective ways of reveal the hidden, unknown or the verify regularity, and 
further the model. 

The predictive model will predict the value of the data from results known from 
different data. It can accomplish data mining tasks, including classification. Map data 
to the predefined group or class. Return (the data item is mapped to a real predictor 
variable). Time series analysis, the behavior of determines the series according to the 
distance and structure. Forecast data value based on the historical event sequence dia- 
gram. Predict (based on past and current data to predict the future state of the data). 
Descriptive model identify the data in patterns or relationships. Different from the pre- 
dictive models, descriptive models provide the method of data nature analysis, rather 
than predict the new nature. It usually consists of Gather (unsupervised learning or 
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partition), Accumulate (to be accompanied by a brief description of the data mapped to 
the subset), association rules (reveal the relationship between the data) and the series 
discovery (identifying data related to time between the sequence modes). 


4 Association Rules in Bank Risk Assessment 


At present, the association rule mining technology has been widely used in western 
financial industries and enterprises; it can successfully forecast demand for bank cus- 
tomers. Once got this information, banks can improve their marketing. The current 
banks are developing new methods of exploring new way of communication with cus- 
tomer every day. All banks bundled product information that may be interesting to 
customer in their ATM machines. To benefit customer who use their ATM and what to 
understand the products. If the database shows that a high credit customer change the 
address, the client is likely to buy a new bigger house, so there may need a higher cre- 
dit limit, the higher end of the new credit card, or need a housing improvement loans, 
these products can be mailed to the customer through the credit card bill. When cus- 
tomers call to consult, the database can effectively assist telephone sales representative. 
Sales representative's computer screen can show the characteristics of clients, and what 
products the customer would be interested in. 


4.1 Association Rules 


Agrawal first proposed in 1993, equivalent to mining a database of customer transac- 
tions association rules between sets of items, and designed a basic algorithm, its core is 
based on the frequency of the recursive method of set theory, which is based on the 
frequency set of two-stage method of thinking , the design of association rules is de- 
composed into two sub-problems: ( found that the frequency set. This sub-issue is 
most important, the most expensive, therefore, focused on various algorithms to im- 
prove the efficiency of frequent item set discovery. @) According to the obtained fre- 
quent item sets to generate strong association rules. The algorithm uses the following 
two basic properties. The nature of a subset of any frequency band must be set. The 
nature of any non-frequent item sets 2 superset of a non-frequent item sets. 

Apriori Algorithm is one of the classic data mining algorithms of association rules. 
One important step of Apriori is pruning, which matches every subset of the candidates 
with the frequent sets of previous layer, and then those infrequent sets will be removed 
according to one character of Apriori. This operation becomes the fact of time-cost of 
Apriori. A new algorithm named NPA (No Pruning Apriori) referred in this article is 
based on Apriori and, by means of modifying the JOIN operation; the pruning opera- 
tion has been canceled. Such improvement enhances the speed of the algorithm and it 
brings practice application value in a certain degree. Apriori specific algorithm is as 
follows: 


(1) L1 = {large 1-itemsets}; 

(2) for (k = 2; Lk-1 4 k + +) do begin 
(3) Ck = apriori_gen (Lk-1); 

(4) for all transactions t © D do begin 
(5) Ct = subset (Ck, t); 
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(6) for all candidates c © Ct do 

(7) C.count + +; 

(8) End; 

(9) Lk = {c © Ck| c.count > minsup} 
(10) End; 

(11) Answer = UkLk; 


The basic idea of this algorithm is: first find all frequent sets, such as frequent item sets 
occur at least a predefined minimum support the same. Generated by the frequency of 
collection and strong association rules, these rules must satisfy minimum support and 
minimum confidence. Then use Step 1 to find the desired frequency set of rules gener- 
ated, resulting in only a collection of items containing all the rules, in which the right 
side of each rule is only one, here is the rules used in the definition. Once these rules 
are generated, then only those greater than the minimum confidence given by the user 
was only to stay the rules. In order to generate all frequency sets, using the recursive 
method. 

For many applications, the dispersion of the data distribution, it is difficult in the 
most detail level data find strong association rules. Although the rules drawn on a 
higher level may be general information, however, it is common for a user's informa- 
tion, but not necessarily so for another user. Therefore, data mining should provide a 
dig at multiple levels of functionality. Multi-level association rules mining are general- 
ly two ways: one is the single-level association rule mining algorithms directly 
applied to multi-level; the other is applied at different levels of different support thre- 
shold and confidence threshold. Existing multi-level association rule mining algorithm 
is mainly Improved Association Rules Algorithm. 


4.2 Improved Association Rules in Bank Risk Assessment 


Set each item I as an example, each transaction is a line in database D, together con- 
structed incidence matrix M, M is equal to m * n, n is the total number of I, m for the 
database D contains the total number of Project Services. In the association matrix that 
contains the items that each firm, also contains a similar transaction. 

Algorithm ideas, for the sum of each row of the matrix associated, the number of 
items calculated Affairs. Calculated to support the number of items removed is less 
than the number of sets to support the remaining items to our collection to find. 

Input, transactional date T, the minimum support minsup count digital. Out put, the 
maximum frequency item set L. 


1. C [n] = 0; //C [n] n the maximum number of items 
2. For each ti is Tdo { 

3.1=I|til 

4.C fi] =C [fi] +1 

5.} 

6. Fori=nto 1 { 

7. If (c [i] is greater than minsup) then { 

8.K=i 

9. Break 
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e 10.} 

e 11.} 

e 12. Fori=kto 1 do { 

e 13. Ck= {select k - item sets} 

e 14. For each Ci is Ck do { 

e 15.Lk= {Cicount Ci is greater than minsup} 
e 16.} 

e 17. If Lk is not equal to return Lk 

e 18.} 


From the point of example calculation, the use of matrix, only scan the database once, 
reducing the operation, and break the conventional approach using the first hypothesis, 
set out to find items from the high set, to minimize the number of data sets to improve 
work efficiency. 


4.3 Improved Association Rules in Bank Risk Assessment 


Select a month credit card records and customer information as credit card data, raw 
data, set the item sets C1, C2 ... Ck represent clients in T1, T2 ... Tk business consumer 
behavior. 

First, scan personal credit card data, to obtain a matrix set, select the previous data 
sets. 

Second, calculate the number of frequent item sets, the maximum was 7. Based on 
business experience, book the minimum support is 50. 

According to Apriori algorithm, calculate the various supports. Assume that the 
most frequent item sets of 6, to obtain {C1, C2, C5, C8, C12, C13} support for the 67, 
{C1, C2, C5, C9, C12, C14} support for the 23, {C1, C2, C4, C10, C12, C13} support 
82, which said that {C1, C2, C5, C8, C12, C13} and {C1, C2, C4, C10, C12, C13} for 
the maximum frequent item sets. Compared to the classic Apriori algorithm, the im- 
proved method only scans the database once, and improves work efficiency. Based on 
this data, you can engage in business promotion with the proposed business alliances 
{T1, T2, T5, T8, T12, T13} and {T1, T2, T4, T10, T12, T13}, and expand consumer 
groups, the maximum boost consumer spending, meaning a positive win-win business. 


5 Conclusion 


Domestic financial enterprise data management applications generally brings together 
the vast amounts of data, but can not find the relationship and business rules exists in 
the data to do risk prediction assessment. Therefore, the domestic financial companies 
need to accelerate the pace of information technology in regions of integration of cus- 
tomer resources, business analysis and investment decisions. This paper analyzes the 
risk assessment approach of banks, mainly focuses on the analysis of association rules 
data mining in bank risk assessment, and discusses the working principle of improved 
association rules algorithm genetic algorithm in commercial bank risk assessment. 

We described the methods and processes of system application. We select the ma- 
trix form, only scan the database once, and use the method of selecting assumption 
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frequent items and numbers, find the frequent item sets through high end item sets, 
minimize the number of candidate data sets, greatly improve the efficiency of the 
algorithm. 
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Abstract. A wide bandwidth microstrip array antenna is proposed numerically 
and experimentally. By cutting an inverted slot in the patch and using the 
stacked structures, the impedance bandwidth of the presented antenna is broa- 
dened. In addition, the distance of elements of the array is properly settled and 
the coupling between the elements has been reduced effectively and the dimen- 
sion of the array antenna is reduced by about 50%. Experimental and numerical 
result shows that the proposed array antenna, with compact size, has an imped- 
ance bandwidth range from 5.01GHz to 6.09GHz for voltage standing-wave ra- 
tio less than 2, which can meet the requirement of WLAN, WiMAX and C-band 
communication applications. 


Keywords: microstrip array antenna; wideband antenna; L-shaped slot. 


1 Introduction 


With the increasing applications of the wideband wireless technologies, the WLAN, 
WiMAX and SAR systems have been in the spotlight worldwide. Various antennas 
with wide bandwidth have been proposed recent years. For the available designs, the 
printed monopole antennas reported in [1-5] have narrow bandwidth and lower gains. 
However, most of them were addressed to the needs of WLAN applications, and per- 
sonal communication systems [5], very limited compact antenna designs have in- 
cluded the 5-6GHz for WLAN, WiMAX and SAR applications. Recently, a lot of 
wideband antennas have been investigated in [6-9]. The proposed technologies in- 
creased the impedance bandwidth of the antennas. However, the dimensions of the 
antennas are large and bandwidth is still narrow which limited its applications. 

For these reasons, the bevel technology and parasitic elements method [10-11] are 
given to improve the bandwidth of the antennas. However, these antennas have larger 
size and lower gain. Therefore, the proposed antennas can’t meet the demand of the 
wideband applications. In this paper, a wideband array antenna is investigated numer- 
ically and experimentally. In order to enhance the bandwidth of the array antenna, the 
stacked antenna is employed and an inverted L-shaped. The L-shaped slots are cut in 
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the different stacked patches. The distance of elements of the array is properly settled 
and the electromagnetic coupling between the elements is also reduced. What is more, 
the dimension of the array antenna is reduced by about 50%. Experimental and nu- 
merical result shows that the proposed array antenna, with compact size, has an im- 
pedance bandwidth range from 5.01GHz to 6.09GHz for voltage standing-wave ratio 
less than 2, which can meet the requirement of WLAN, WiMAX and SAR communi- 
cation applications. The gains, the simulated and measured standing-wave ratio, radia- 
tion patterns are also given and discussed herein. 


2 Antenna Design 


Fig.1 illustrates the geometry and the configuration of the proposed array element o 
the proposed wide band antenna for WLAN, WiMAX and SAR applications. Three 
layers are used to construct and support the investigated wide band antenna. The an- 
tenna element consists of three substrates, two patches and two foams. The two reson- 
ance frequencies are so near that the bandwidth is enhanced. The sub1 used in the 
paper has a dielectric constant € = 2.2 and the thickness is 1.5mm. The sub2 and the 
sub3 used in the paper have a dielectric constant € = 2.9 and the thickness is 1.5mm. 
The foam between the patch! and patch2 is Rohacell 71HF which has a dielectric 
constant € = 1.07 with thickness is 5.3mm. In order to obtain a directional radiation 
pattern, a reflector is employed in the array element. The patch! and patch2 are sim- 
ple rectangular patches with L-shaped slot and inverted L-shaped slot. The ground 
hidden in the middle of the sub2 and sub3 can improve the radiation patterns effec- 
tively. A probe is implemented to feed the array element. The probe via the sub3 and 
sub2, connect to the patch2, and the patch! is excited by the coupling of the patch1. 
The details parameters of the array element are listed in the table 1. 


Feed point 


Feed line 


reflector 


Fig. 1. Configuration of array element 
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Table 1. Structure size of array element 


WwW L d h hy hy 
Patchl 11 18.6 2 12.5 3.3 3.6 
Patch2 8.5 16 0.9 6.6 3.4 1 


The feed line and probe transmitted the energy to the main radiation structure. The 
patch2 is excited and the patch is also coupled. With the matching line, the return 
loss of the array antenna is improved very well. In this design, the patch2 coupled the 
energy to the patchl. By using the L-slot, the current path is changed and the path is 
also prolonged, which leaded to the increased bandwidth. The resonant frequency can 
be easy controlled by adjusting the dimension of the patch2 and patch! and the 
L-shaped slot cut in the patches. In the design, the patch! is the parasitic patch, the 
radiation patch (patch2) excites the parasitic element. Therefore, two resonance fre- 
quencies produced. The two resonance frequencies are so near that a wide band an- 
tenna is formed. 


3 Results and Discussions 


Based on the above discussions and analysis, an array antenna is designed, manufac- 
tured and tested. In order to meet the SAR applications, a 2 x 2 array is analyzed by 
using High Frequency Structure Simulator(HFSS), and the fabricated array is meas- 
ured by using Anristu 37347D vector network analyzer and the radiation patterns are 
obtained in the Chamber. The photo-type of the array is shown in Fig. 2. 


Fig. 2. Geometry of the array antenna 


In the Fig.2, array spacing is about 0.7549. The simulated and measured VSWR 
(VSWR<z2) is shown in Fig.3. It can be seen from the Fig.3, the simulated and meas- 
ured results meet very well. The simulated impedance bandwidth is range from 
4.9GHz to 5.8GHz. And the measured bandwidth covers 5.01GHz to 6.09GHz. The 
differences between the simulated and measured values may be due to the errors of 
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the manufactured antenna and the SMA connector to CPW-fed transition, which is 
included in the measurements but not taken into account in the calculated results. 


46 48 50 52 54 56 58 6.0 


— simulated 
measured 


VSWR 


“4.55 4.95 5.35 5.75 6.15 
f/GHz 


Fig. 3. Measured return loss of the array 
—— 5.35GHz 


-onas 4.85GHz 
30... 5. 85GHz 


(b)H-plane 


Fig. 4. Radiation patterns of the array antenna 
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(b) H-plane 


Fig. 5. Measured pattern at 5.35GHz 


The radiation patterns of the array antenna at 4.85GHz, 5.35GHz and 5.85GHz are 
measured and shown in Fig.4. And the cross polarized is also tested and shown in 
Fig.5. The radiation pattern is in the broadside direction. And the cross polarized level 
is above 12dB. The E-plane and the H-plane at varying frequencies have the similar 
radiation patterns which meet the requirement of SAR applications. The radiation 
patterns in the other frequencies have the consistent radiation characteristics, which 
are not given herein. The maximum gain of the array is 14.76dBi. 


4 Conclusion 


A novel probe-fed stacked array antenna with L-shaped slot is realized in the paper. 
The array antenna is constructed by using multi-substrate and two patches. The pro- 
posed array shows wide bandwidth and consistent radiation patterns in the operation 
band. The simulated and measured results show that the array has higher gain and 
smaller size which is suitable for SAR applications. 
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Abstract. Aiming at the information loss, serious noisy and low resolution of 
sonar image, a sonar image fusion denoising method based on multiple morpho- 
logical wavelet packets is proposed. Firstly, gave a morphological midpoint 
wavelet under the perfect reconstruction condition; Secondly, defined morpho- 
logical wavelet packet to construct morphological Haar wavelet packet, mor- 
phological median wavelet packet and morphological midpoint wavelet packet 
for the noisy image decomposition and threshold processing; Finally, did fusion 
processing of low-frequency and high-frequency components separately accord- 
ing to certain fusion rules and got the final output image through the wavelet 
packet inverse transform. The simulation experiment result shows that the pro- 
posed method is more adapted to sonar image denoising than the single mor- 
phological wavelet denoising method. 


Keywords: Sonar image denoising; Image fusion; Morphological wavelet; 
Wavelet packet. 


1 Introduction 


For single acoustics imaging instrument is usually unable to obtain satisfactory image, 
and the underwater environment is extremely complex, most of the obtained sonar 
images have serious noise pollution and low resolution, which undoubtedly brings 
great difficulties to the late detection, identification, tracking, etc. As an important 
branch of inter-discipline which relates to Data Fusion and Image processing, image 
Fusion aims at getting a more accurate, comprehensive and reliable image description 
through fusion rules by doing the information extraction, automatic analysis and op- 
timal synthesis for multiple image which is the same object obtained in different sit- 
uations (different observation time, different viewing angles, different sensors, etc). 
Image fusion has many outstanding advantages: on the one hand, it improves image 
resolution to increase the information credibility, and on the other hand, it reduces the 
requirements on a single image quality[1]. For fused image describes the integrated 
features of the object, it contains more abundant amount of information than the orig- 
inal image, therefore, from another point, image fusion can be consider as image 
enhancing. In addition, image fusion is not merely a synthesis of original images, but 
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a optimal synthesis that makes the synthesis image contain maximum useful informa- 
tion and discard useless information, so it is also do image denoising simultaneously 
[2]. This paper considered the application of image fusion technology in the sonar 
image denoising field, and gave a sonar image fusion denoising method based on 
multiple morphological wavelet packets. The experimental simulation result validates 
the feasibility and effectiveness of the proposed method. 


2 Image Fusion Based on Wavelet Packet 


Image fusion has three kinds: pixel-level image fusion, feature-level image fusion and 
decision-level image fusion [3]. The popular pixel-level image fusion methods are 
weighted average method, tower transform method, wavelet transform method and 
recent super wavelet transform (ridgelet transform[4], curvelet transform[5], contour- 
let transform[6], directionlet transform[7], etc). Image fusion based on wavelet packet 
transform is shown in Fig.1, and the steps are: firstly, do wavelet packet decomposi- 
tion for two registration images separately; secondly, make data fusion of different 
frequency sub-bands on each decomposition level using different fusion rules; finally, 
take wavelet packet inverter transform, and the reconstruction image namely is the 
fusion result image. 


Wavelet Packet Transform 


Image 1 


/ ___ \Reconstruction 
\ Fusion } > 


Output nage 


Wavelet Packet Transform 


Image 2 


Fig. 1. Image fusion based on Wavelet packet transform (three levels). 


3 Sonar Image Fusion Denoising Method Based on Multiple 
Morphological Wavelet Packets 


3.1 Morphological Wavelet 


Both the nonlinear property of morphological filter and the multi-resolution characte- 
ristic of wavelet transformation are taken account of in the morphological v wavelet 
algorithm, so it has higher research value than linear wavelet. It is proposed by Heij- 
mans and Goutsias, and they also gave the perfect reconstruction condition when 
constructing morphological wavelet [8]. The perfect reconstruction condition for the 
uncoupled Morphological Wavelet is as follows: 
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ysis and synthesis operators respectively; Ww, is the detailed space, o' W, Wi 
and @” Wis 

A number of new morphological wavelet can be constructed in this framework, 
such as the existed morphological Haar wavelet (MHW) in [8] and morphological 
median wavelet (MMedW) in [9]. In this paper, we will construct a morphological 
midpoint wavelet (MMW). 

Firstly, make the midpoint filter as the low-pass filter of morphological wavelet, 
namely the signal analysis operator: 


— W, is the detailed analysis and synthesis operators respectively. 
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where the signal “A” and “v 
respectively. 
Secondly, define the detailed analysis operators as the difference of the two pixels 


in horizontal, vertical, diagonal directions as follows: 


means the minimum and maximum operation 


oo (x)(n) = (@, (x)(n),@, (2)(n), @, (2)()), (5) 
y,(n) = @, (x)(n) = x(2n) — x(2n") (6) 
y, (NM) = @,(x)(n) = x(2n) — x(2n, ) (7) 
y(n) = @, (x)(n) = x(2n)— x(2n}). (8) 


Thirdly, according to the perfect reconstruction condition (1)-(3), synthesis operators 
can be obtained as follows: 
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@ (y\(2n,) = D= y, (n) (12) 


@ (y\(2n*)=D-y,(n). (13) 


Then morphological midpoint wavelet based on the perfect reconstruction condition is 
constructed. 


3.2 Morphological Wavelet Packet 


In 1992, Coifman, Meyer and Wickerhauser gave the concept of wavelet packet [10], 
while in this paper, we give the concept of morphological wavelet packet. Morpho- 
logical wavelet transform contains signal analysis operator and detail analysis opera- 
tor, which respectively corresponding to low-pass filter and high-pass filter. The 
image is decomposed into 4 sub-bands by each level of morphological wavelet trans- 
form: a low-frequency sub-band (signal component) and three different directions of 
the high frequency sub-band (detail component). The low-frequency sub-band will 
continue to do multiresolution decomposition at the next level, but the high-frequency 
sub-bands are no longer decomposed. Similar to wavelet packet transform, in mor- 
phological wavelet packet transform, both the low-frequency sub-band and the high 
frequency sub-band of each layer after decomposition will do the multiresolution 
decomposition at the next level. Morphological wavelet decomposition and morpho- 
logical wavelet packet decomposition is shown in Fig.2, by morphological wavelet 
packet, image can be decomposed into 4 sub-bands at the first level, 16 sub-bands at 
the second level, 64 sub-bands at the third level, and so forth, #4” sub-bands at the m 
level. 


(a) Original image (b) Morphological wavelet (c) Morphological Wavelet packet 
decomposition decomposition 


Fig. 2. Morphological wavelet and morphological wavelet packet decomposition (three levels). 


Image after morphological wavelet fusion, the low-frequency information can be 
well represented, while the high frequency information is lost a lot, which is not satis- 
factory fusion result for the image contains large amount detail information [11]. So, in 
this paper, we consider the more extensive and accurate image fusion method based on 
morphological wavelet packet. Morphological wavelet packet has the advantages 
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of both morphological wavelet and wavelet packet, such as nonlinearity of morpho- 
logical filters, multi-resolution of wavelet and comprehensiveness of wavelet packet. 
According to the existing morphological Haar wavelet, morphological median wave- 
let and morphological midpoint wavelet proposed in the paper, morphological Haar 
wavelet packet, morphological median wavelet packet and morphological midpoint 
wavelet packet can be constructed respectively. 


3.3. Sonar Image Fusion Denoising System 


Make the above three wavelet packets for sonar image fusion denoising system, the 
flow chart is shown in Fig.3, and the specific steps are as follows: 


(1) Do morphological Haar wavelet packet, morphological median wavelet packet 
and morphological midpoint wavelet packet decomposition respectively for the noisy 
image to obtain multi-resolution high-frequency and low-frequency sub-bands; 

(2) Do threshold processing for morphological wavelet packet coefficients of high- 
frequency sub-bands as image denoising method based on wavelet transform; 

(3) According to appropriate fusion rules, do data fusion for morphological wavelet 
packet coefficients of high-frequency sub-bands and low-frequency sub-bands respec- 
tively, then the new morphological wavelet packet coefficients of high frequency sub- 
band and low frequency sub-band can be obtained; 

(4) Do morphological wavelet packet reconstruction, and right now the output im- 
age is the result image using fusion denoising method based on multiply morphologi- 
cal wavelet packets. 


Among them, the fusion rule has a great impact on image fusion result. Because low- 
frequency component after image decomposition represents the image approximate 
part, this paper chooses mean algorithm as the fusion rule, make the average value of 
low-frequency coefficients as the fusion coefficient; for high-frequency component, it 
reflects the image detail part, so we selects the maximum (MS) fusion rules, which 
choosing the modulus maxima of corresponding pixels as the fusion coefficients. 


Morphological | laar 
Wavelet Packei Transform 
\ 
A 
\ 
\ 
24 
\ 
x 
\ 
Morphological Median & BQ 
noisy Wavelet Packet Iransform { seal Reconstruction Output image 
image \ ) 
- 
/ 
7 
Morphological Midpoint / 
Wavelet Packei Transform / 


Fig. 3. Image fusion denoising method using three different morphological wavelet packets 
(three levels). 


694 H. Shi et al. 


The low-pass filter of morphological Haar wavelet packet is the maximum or mini- 
mum filter, according to the characteristics that maximum and minimum filters can 
remove the "pepper" noise and "salt" noise, morphological Haar wavelet packet can be 
inferred that the denoising performance for the "pepper" noise and " salt "noise is more 
pronounced; by analogy, morphological median wavelet packet, using median filter as 
low-pass filter, is more effective for the single-stage or bipolar impulse noise remove; 
morphological midpoint wavelet packet, constructed by midpoint filter in this paper, 
has better performance on Gaussian and uniform random noise denoising system. The 
fusing denoising method based on the three morphological wavelet packets above 
combines the characteristic of them, so it has all their advantages and is more applica- 
ble to image denoising of the image with complex noise. For the noise sources of sonar 
image is extremely complex, such as instrument itself noise, underwater environment 
and so on, denoising use a single morphological wavelet packet may cannot get a satis- 
factory sonar image. So, in this paper, we combine three morphological wavelet pack- 
ets together to obtain a more comprehensive and detailed denoising result. 


4 Simulation Experiment 


In the simulation experiment, we selected a 256x256 sonar image as original image 
(OD), and captained noisy image (NI) by adding Gaussian noise whose noise standard 
deviation is 0.1. Then we did the compare simulation experiment of the fusion denois- 
ing method in this paper, the single morphological Haar wavelet (MHW) denoising 
method in literature [8], the single morphological median wavelet (MMedW) denois- 
ing method in literature [9] and the single morphological midpoint wavelet (MMW) 
denoising method in our paper to verify the feasibility and effectiveness of the pro- 
posed method. For morphological wavelet packet decomposition, although more de- 
composition levels can bring more extensive details to the fusion image, but with the 
number of decomposition level increases, the number of sub-band will increase expo- 
nentially at the same time, which results in large computation, so we set the decompo- 
sition level to 3 in the experiment. The simulation result is shown in Fig.4. 

Noise ratio (SNR), peak signal to noise ratio (PSNR) and mean square error 
(MSE), as the current performance indicators, were taken in the following to con- 
cretely evaluate the various denoising methods above. For image denoising, the high- 
er SNR and PSNR, the lower MSE, means the better effect of the system. Table 1 
shows the statistical properties of different methods in our experiment. 

It can be seen in Fig.4 and Table 1, comparing with single morphological wavelet 
denoising method, our method has better denoising effect. Table 1 shows that the 
indicators of fusing denoising method are all improved, this is because that the fusion 
denoising method makes a more comprehensive denoising by combine the characte- 
ristics of the various morphological wavelet. In addition, it also can be seen from 
Fig.4 that the proposed method not only have better performance in denoising, but 
also reserve the detail of the image well, the reason is that taking morphological 
wavelet packet instead of morphological wavelet can makes the decomposition more 
detailed and comprehensive, so the fusion image retains the edge and other detail 
information. To sum up, the proposed method is more suitable for sonar image 
denoising. 
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Fig. 4. The simulation results of sonar images denoising using different methods. 


Table 1. The statistical properties of sonar images denoising using different method(dB). 


NI MHW MMeW MMW Fusion 
SNR 5.6036 T.AT78 8.1896 9.7436 9.9864 
PSNR 19.9734 21.5476 22.5594 24.1134 24.8316 
MSE 0.0101 0.0070 0.0055 0.0039 0.0030 


5 Conclusion 


In this paper, we combined nonlinearity of morphological filters, multi-resolution of 
wavelet and comprehensiveness of wavelet packet together to give the definition of 
morphological wavelet packet, and constructed a sonar image fusion denoising system 
based on multiple morphological wavelet packets. The simulation experimental result 
shows that the proposed method has better performance than the single morphological 
wavelet denoising method, and the edge preserving effect is also superior, so it is 
more adapted to the denoising of sonar image with serious noise pollution. 
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Abstract. Aimed at the deficiency that non-equilibrium M-Z interferometric 
demodulation method only applies to dynamic demodulation systems and ac- 
cording to the characteristic that there is a linear relationship or near-linear rela- 
tionship between wavelength shift and demodulation parameters (light intensity 
or transmission rate, etc) caused by the change of measured field physical quan- 
tity, this paper designs a kind of fiber Bragg grating linear demodulation sensor 
which demodulates Bragg wavelength by linear demodulation method and it 
realizes the measurement of quasi-static parameters. Apply this design scheme 
to the design of fiber grating temperature sensing system and experiments show 
that the system finishes research requirements according to schedule. 


Keywords: fiber Bragg grating; fiber grating sensor; M-Z interferometer; wa- 
velength demodulation. 


1 Introduction 


Fiber grating sensor is a kind of optical fiber sensor and the sensing process based on 
fiber grating is to get sensing information by external physical parameters modulating 
fiber Bragg wavelength and therefore, it is a kind of wavelength modulation fiber 
sensor. Because there is a natural compatibility between fiber grating and fiber, quasi- 
distributed sensing can be realized easily and the sensing signals of fiber grating itself 
are wavelength modulation, measuring signals are unaffected by light source fluctua- 
tion, fiber bending loss, light source power fluctuation and system loss, the use of 
fiber grating in the sensing field has caused wide attention and great interest among 
related scholars all over the world [1]. 

When the temperature or stress on a fiber grating sensor changes, Bragg reflection 
wavelength shifts and the quantity to be measured can be judged according to the 
shift, and therefore people put forward many demodulation methods, such as filtering 
method, interferometry and adjustable light source scanning method, etc. Compared 
with other demodulation techniques, interferometer demodulation method has 
extremely high detection sensitivity and many foreign researchers have studied the 
method [2, 3]. Combined with the working principles and characteristics of 
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non-equilibrium M-Z fiber interferometer demodulation method, this paper demodu- 
lates Bragg wavelength by linear demodulation method, designs a kind of fiber 
grating linear demodulation device, establishes a fiber grating temperature sensor 
experimental system and makes an experimental result analysis of the system. 


2 Signal Demodulation Technique of Fiber Grating Sensor 


2.1 Requirements for Accuracy 


The theoretical analysis and research of fiber Bragg grating show that the temperature 
and strain sensitivity of fiber Bragg grating are very small. When the wavelength of 
grating is 1500nm, typical temperature and strain sensitivity are 0.011nm/°C and 
0.012nm/ue respectively. To reach the measurement accuracies of 1°C and 10ue, the 
measurement accuracy of shifting center wavelength by A\/ should be superior to the 
magnitude of 0.01nm. And therefore, the detection accuracy of A\/ directly limits 
the detection accuracy of the whole system. The detection technique of AA becomes 
one of key techniques in fiber grating sensing. 


2.2 Interferometer Demodulation Method 


At present, there are three kinds of different interference measurement structures 
adopted in fiber grating sensors, namely Michelson structure, Sagnac structure and 
Mach-Zehnde structure, among which Mach-Zehnde M-Z interferometer can real- 
ize wide-bandwidth and high-resolution demodulation ability and its cost is low, so 
this paper carries out scheme design by M-Z demodulation method. 

M-Z interferometer demodulation technique mainly has three different implemen- 
tation methods, namely non-equilibrium M-Z interferometric demodulation method, 
external modulation M-Z interferometric demodulation method and interferometric 
demodulation method based on 3x3 coupler [4]. 

Non-equilibrium M-Z fiber interferometer demodulation method has advantages of 
wide bandwidth and high resolution, etc, but the influence of random phase shift de- 
cides that this scheme only applies to dynamic demodulation systems [5], unsuitable 
for quasi-static detection. For this reason, this paper puts forward an improved 
scheme which can be used for quasi-static detection and the improved scheme is to 
add a reference grating to the non-equilibrium M-Z interferometer, which is equiva- 
lent to adding a modulating frequency whose size is w to the received signal. The 
reflection signals of sensing grating and reference grating are processed by a phase 
meter after passing a band-pass filter, which can eliminate the interference of random 
phase difference and make it apply to the linear measurement of quasi-static strain. 


2.3. Analysis of Linear Demodulation Principle Based on M-Z Interferometer 


The fiber Bragg grating sensor designed in this paper demodulates Bragg wavelength 
by linear demodulation method which is a kind of wavelength shift demodulation 
technique put forward for solving field practicality. The starting point of adopting this 
scheme is based on that there is a linear relationship or near-linear relationship be- 
tween wavelength shift and demodulation parameters (light intensity or transmission 
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rate, etc) caused by the change of measured field physical quantity [6] and the scheme 
is suited to the measurement of quasi-static parameters. M-Z interferometer is com- 
posed of two 3dB fused-taper couplers, shown as Fig 1. 


fy 


PY Py 
Ps 


fy 


Fig. 1. Schematic Diagram of M-Z Interferometer 


When satisfying the interference condition L, — L, <4 4//A, M-Z interferometer has 
a comb filtering characteristic, at the rising or falling edge of output spectrum, the pow- 
er ratio of two output arms changes very quickly with wavelength change, a filter with 
steep edges can be got by adjusting the length difference of the two arms of M-Z inter- 
ferometer, high-accuracy wavelength shift detection can be carried out by using the 
characteristic, it is the linear demodulation principle based on M-Z interferometer[7]. 


3 Design of Sensing Demodulation Experimental System 


3.1 Composition of Demodulation System 


The demodulation system designed in this scheme is mainly composed of the follow- 
ing parts, namely light source, fiber coupler, fiber Bragg grating, spectrometer, M-Z 
interferometer, photodetector, conversion circuit, operation processing circuit and 
A/D converter. 


3.2 Composition of System Hardware 
Optica path 


Selection of light source. The characteristics of light source decide if a fiber system 
can reach expected indexes. In this topic, the light-emitting device which is used as 
light source should meet the following conditions: 


(1) Small volume, the light-emitting area should be matched with the size of fiber 
core diameter and there should be a high coupling efficiency between light source and 
fiber. 

(2) Emission wavelengths should be suitable for two low-loss wave bands of fiber, 
namely short wavelengths should be between 0.8 and 0.9um and long wavelengths 
should be between 1.2 and 1.6um. 

(3) It can carry out light intensity modulation directly and its connection with the 
modulator should be very convenient. 

The light-emitting devices which are often used for fiber sensing are semiconduc- 
tor laser (LD) and light emitting diode (LED). This scheme adopts a light emitting 
diode as the light source of the system [8]. 
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Selection of coupler. The fiber coupler adopted in this topic is a kind of all-fiber di- 
rectional coupler and its main characteristics are as follows: 


(1) Its body is optical fiber, excluding other optical elements. 

(2) It realizes light coupling function by the coupling effect of transmission mode 
in fiber. 

(3) The direction of optical signal transmission is fixed. 


Bragg grating. Sensing grating plays an important role in fiber grating sensing and we 
mainly consider the following aspects: 


(1) Working waveband of sensing grating. 

(2) Reflection bandwidth, reflectivity and length of sensing grating. 

(3) Side mode suppression. 

(4) Wavelength interval and buffer area of sensing grating (for sensor network). 


The Bragg grating used in this system has a center wavelength of 1543.357nm and a 
reflection bandwidth of 0.3nm, its reflectivity is greater than 90%, its length is 10nm 
and its side mode suppression ratio should be higher than 15dB. 


Spectrometer. The spectrometer used in this experiment is AQ6317C spectrum ana- 
lyzer which is used to monitor the reflection wavelength of fiber Bragg grating and 
the shift value of reflection wavelength with temperature change. In the experiment, 
wavelength resolution Res = 0.5 nm and spectrometer scan number AVG = 10. 


Interferometer 


Making of interferometer. All-fiber M-Z interferometer is made by welding two 3dB 
couplers on two fibers continuously and the length difference between its two arms is 
about 0.5mm. 


Noise of interferometer. The noise of interferometer is the main noise source this 
system considers and it directly influences measurement results. 


P, — P, 2nd 
= = COS = COS 
P,P, 2 ( ) (1) 


A 


It can be known from formula (1) that A¢ =—sin g- Ag, the phase change of interfe- 


rometer will lead to a significant change of system output. When the arm length dif- 
ference d and fiber core refractive index n of interferometer change with the change of 
external environment, the phase output of interferometer can be expressed as: 

T 2 


2 2 1 
Ag= 7 nAd + 7 dAn zz dnAa (2) 


In formula (2), Ad is the change of arm length difference caused by environmental 
factors; /\n is the change of refractive index of two arms of interferometer caused by 
environmental factors. It can be seen from (2) that Ad, An and A\/ can all modulate 
phase to give rise to interference noise. In this way, noise can be divided into 
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stress-strain effect noise, thermal expansion and cold contraction effect noise and 
thermodynamic intrinsic phase noise. Environmental mechanical vibration and sound 
field are main factors causing stress-strain effect. The modulation of phase by temper- 
ature change mainly considers the effects of Ad and An. 


Circuit realization. The circuit is the most important part as well as the core of the 
demodulation system. The circuit system design integrates multiple functions as a 
whole, such as photoelectric detector, regulated power supply, SCM and serial port, 
etc, and the circuit can magnify signals about 1,000 times. 

The light outputted from two arms of the interferometer enters a switch after pass- 
ing the photoelectric detector, the switch can control the reception of signals of two 
arms or one arm and then control the magnification of signals by an amplification 
circuit controller, at last, the light enters A/D converter, these control circuits are all 
controlled by a SCM. In addition, an upper serial port must be designed in the circuit 
to transmit signals to the computer for processing. 


4 Experimental Result Analysis 


4.1 Temperature Sensing Experiment 


Before carrying out a characteristic experiment of the sensing system, we should 
debug every part in light path and circuit first. The center wavelength of the light 
source used in this experiment is about 1551nm and its bandwidth is about 40nm. At 
24°C, the reflection wavelength of the used fiber grating is 1543.322nm, its reflection 
bandwidth is about 0.3nm and it has good side mode suppression effects. 

In the experiment, put a fiber grating device in water and reflect the wavelength 
variation of fiber grating by changing water temperature. The wavelength shift of 
fiber grating is measured by AQ6317B spectrometer OSA; spectrum scan number 
AVG = 10; the 3dB coupler not only couples the light emitted by the wideband light 
source into fiber grating but also couples the light reflected by the fiber grating into 
OSA for detection; in the experiment, the matching liquid should be added to elimi- 
nate the effects of another path of reflected light. The relation curve between the wa- 
velength variation of fiber grating encapsulating device and temperature can be got, 
shown as Fig 2. The change of water temperature is between 24°C and 91°C. 

It can be seen from Fig 2 that as temperature sensing, fiber Bragg grating has a 
good linearity. However, its temperature sensitivity is very low and encapsulating the 
fiber grating with an aluminum cap can increase its temperature sensitivity by three 
orders of magnitude. The fiber grating has a great shortcoming: it is very fragile. It 
will fracture unless you give your whole attention to it. 


4.2 Sensing Experimental Result Analysis 


There are the following problems and deficiencies in the experimental process and the 
analysis of experimental results. 


(1) Inaccurate water temperature control. 
(2) The detection accuracy of Bragg wavelength is limited. 
(3) Effects of stress on temperature sensing system. 
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Fig. 2. Schematic Change of Fiber Grating Wavelength with Temperature 


4.3 Temperature Demodulation Experiment and Error Analysis 


The sensing demodulation experiment is the crux of this scheme. The splitting ratio of 
the two 2x2 couplers constituting the interferometer often can not reach 50: 50 strictly, 
the maximum values of optical power got at two output arms are unequal, which will 
bring the experiment an error. It’s OK if the consistency of maximum values of electric 
signals got at the two ends of circuit input is ensured, so we can realize it by adjusting 
the multiples of photoelectric conversion circuit and amplification circuit. 

In the experiment, we seal one M-Z interferometer in a closed plastic box first and 
then examine if the M-Z interferometer is under interference conditions: in the expe- 
riment, the interferometer is in the linear region when the wavelength shift of fiber 
grating is between 1544nm and 1546nm; and then drive the light source to operate, 
change the temperature of external environment by changing the temperature on the 
sensing grating and measure the output of operation processing circuit, if the tempera- 
ture detection range of temperature sensor is within 100°C, the temperature sensitivity 
of fiber grating is 0.01nm/°C and the variation range of wavelength is about Inm. 
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Fig. 3. Test Results of Temperature Sensor 
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Fig 3 is the ratio of output power difference to output power sum of two arms of 
the interferometer drawn according to measurement data, namely the relation curve 
between the cosine of phase difference and temperature variation. It can be seen from 
Fig 4 that there is a good linearity between temperature and P,-P; / P2t+P3 , the 
resolvable P,-P; / P+ P3 is 0.0025, then the wavelength detecting device can 
detect 0.018°C, if the temperature sensitivity of fiber grating is 10pm/°C, the mini- 
mum wavelength that the system can detect is 0.18pm, reaching the magnitude of pm. 


5 Conclusion 


On the basis of understanding the current development status of fiber grating demodu- 
lation techniques at home and abroad, this paper carries out the research on fiber grat- 
ing demodulation techniques. It analyzes the linear sensing demodulation principles 
based on M-Z interferometer, designs a new kind of fiber grating linear demodulation 
device and establishes a fiber grating temperature sensor experimental system. The 
experimental system finishes research requirements according to schedule, but there 
are still some shortcomings which need to be further studied and improved in the 
future. 
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Abstract. This paper discusses the secrecy authentication goals of non- 
repudiation protocols and verifies the deficiencies of A (0) protocol in authen- 
tication secrecy by an attack method; and then, it modifies the confirmation 
mode of A (0) protocol after its message format and session key are estab- 
lished and puts forward NA (0) protocol; and finally, it makes a formal analy- 
sis of NA (0) protocol by SVO logic and verifies the authentication meeting 
subject identity of NA (0) protocol and the secrecy of its session key. 


Keywords: non-repudiation protocol; authentication secrecy; formal analysis; 
SVO logic. 


1 Introduction 


Non-repudiation protocols are designed to prevent dishonest people denying that they 
have ever participated in a certain affair and refusing to undertake the corresponding 
responsibility. Non-repudiation protocols have two goals, one is to confirm non- 
repudiation of the sender and the other is to confirm non-repudiation of the receiver. 
A good and secure non-repudiation protocol is the necessary condition of completing 
e-commerce transactions [1, 2]. This paper mainly discusses the authentication secre- 
cy of non-repudiation protocols and puts forward the design and formal analysis me- 
thods of authentication secrecy. 


2 Authentication Secrecy 


In a computer network and distribution system, one subject usually needs to confirm 
the identity of the other subject when carrying out resource access or communication. 
Sometimes keys or other kinds of secrets need to be distributed among subjects, the 
authentication protocol is used to describe how to confirm identities and distribute 
secrets among subjects and it is usually composed of a series of exchange messages 
among subjects. The authentication may concern two subjects or more; it may be a 
one-way authentication or a mutual authentication and it may use a symmetric key 
system or an asymmetric key system. The same with authentication protocols, non- 
repudiation protocols also need to confirm identities and distribute keys among 
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subjects and their requirements for authentication secrecy are the same, so the authen- 
tication secrecy of non-repudiation protocols can be discussed by design and formal 
analysis of authentication protocols. 


3 Design of Authentication Secrecy 


The authentication protocol is the basis of network security and even an authentica- 
tion protocol which is established on a perfect cryptographic system may still have 
various kinds of security vulnerabilities. The difficulties in design and analysis of 
authentication protocol consist in the subtlety of security target itself, the complexity 
of protocol operation environment, the complexity of attacker model and the high 
concurrency of authentication protocol itself. This paper will discuss authentication 
secrecy by the design and formal analysis [3] of A 0 protocol and its improved 
protocol next. 


3.1 A 0 protocol 


A 0 protocol is a kind of key agreement protocol that Matsumoto, Takashima and 
Imai got by modifying the key exchange protocol of Diffie-Hellman. It has advantag- 
es of lightening the burden on the authentication center and limiting its authority. As 
the premise of the protocol, a public big prime number P and the primitive element a 
on finite field GF P must be selected first. Before the protocol starts, the two com- 


munication parties A and B select a random integer x and y respectively and send 


R, = a* (mod P) for A and R, = a (mod P) for B_ got by calculation to the 


authentication center T to obtain their respective public agreement key certificates. 
The certificate is the result that the authentication center T signs the identity of any 


subject C and its public agreement key R. . And then, A and B select a random integer 


x and y respectively, A calculates R, = a* (mod P) and B calculates R, =a” (mod P). 


The obtained R, and R, are called the temporary public agreement key of A and B 
respectively. On this basis, A 0 protocol can be executed. The concrete 
A 0 protocol is as follows: 


(I)ASB: A, Ry. (Raft Ro 
(2) BA: B, Ry. (B.Ry fers Rs 


Here, 14, Ra le is the public agreement key certificate of subject A issued by the 
authentication center T. After B receives message (1), it confirms the identity of A by 
verifying the signature of T and then calculates K,,, =(R,)° (R,)° =a” -a® as 
the shared session key with A, similarly, A can also verify the identity of B and obtain 
the shared session key with B: K,, =(R,)* (Ry) =a" -a*”. At last, A and B es- 
tablish the session key between them K,, by executing A 0 protocol. 
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3.2. Improved protocol of A 0  protocol-NA 0 


Many kinds of methods for attacking A 0 protocol have been found so far. The 
most common attack method is as follows: the attacker P carries out normal commu- 
nication and initiates the first round of protocol execution ahead of A [4]: 


(1) PA: P, Rp. {PLRp ft» Rp 
(2) A>P: A, R, , A Rafe R, 


And then, P retransmits the message that A sends to him to B to initiate the second 
round of protocol execution and intercepts the message that B sends to A. 


(1)P A BA, R,. Raft + Ra 
(2)BoP A :B,R,. (BR, fer ¥Rs 


It can be seen from the above attack method that the attackers mainly use the identity 
of A 0 protocol message format and the characteristic that the subjects of the proto- 
col can not differentiate the initiating party of the protocol from the response party of 
the protocol to initiate effective attacks. To ensure the security of the protocol, modify 
the message format of the protocol first to make the protocol differentiate the initiat- 
ing party from the response party and then carry out handshake confirmation after the 
session key is established to make both parties of the protocol confirm that the other 
party has had an agreement session key. For the defects of A 0 protocol, we im- 
prove it and get NA 0 protocol as follows: 


(1) A>B: N,, A, R,  14,R, bts Ra 
(2) B>A: B, R, , 1B, fi! Rn {Na BNo te, 
(3)A>B: {Na ANs hy, 


Here, N,= H Date Time A_ is the random number P generated by A for marking the 
operation of the protocol and N, links the mutual information in the protocol together. 
Date Time A marks the time subject A spends in initiating the protocol. VN, = H Date 
Time B is the random number generated by B for marking the freshness of agree- 
ment key of the protocol and Date Time B marks the time subject B spends in accept- 
ing the protocol. H is a strong one-way collisionless function. Different Date Time 
have different H Date Time random numbers. The protocol uses Date Time to have 
the ability of resisting and refusing service attacks. If attackers want to let the subjects 
of the protocol waste a lot of time on useless wait and unable to provide services for 
honest subjects, the subjects can cancel the execution of long wait protocols according 
to their own Date Time to make attacks fail. 


3.3. Analysisof NA 0 protocol 


The execution premise of NA 0 protocol is very simple, it only needs a certificate 
issuing center. After the participation parties of the protocol obtain their respective 
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certificates, they can execute NA 0 protocol with the person whom they want to 
communicate with to obtain a temporary key shared by both parties and then they can 
use the temporary key to communicate. NA 0 protocol has made good adjustments 
to message format to make every step of message contain the step information of the 
protocol and avoid type flaw attacks. NA 0 protocol is simple and efficient, the 
establishment of its key only needs three steps in all and it doesn’t have redundant 
information. NA 0 protocol has the ability of resisting and refusing service attacks 
by including communication initiation and acceptance time in protocol information. 
In addition, it can be known by formal analysis of NA 0 protocol that it has the 
ability of resisting replay and forge attacks. 


4 Formal Analysis of Authentication Secrecy 


The design and analysis of authentication secrecy are a very difficult task. Even if we 
only discuss the most basic authentication protocol and it only has two or three partic- 
ipation subjects and three or five exchange messages, to design a correct authentica- 
tion protocol which meets authentication goals and has no redundancy is also very 
difficult. And therefore, a kind of proper formal analysis tool is urgently needed to 
make a rigorous formal analysis of authentication secrecy in the protocol and examine 
if the protocol reaches authentication secrecy and there are security flaws and redun- 
dancies in the protocol. 

The most direct and simplest security protocol analysis method is modal logic me- 
thod based on knowledge and belief inference [5]. They are composed of some propo- 
sitions and inference rules, propositions represent the knowledge or beliefs of the 
subject for messages and new knowledge and beliefs can be deduced from known 
knowledge and beliefs by using inference rules. In this kind of methods, the most 
famous method is logic of BAN, including BAN logic, GNY logic, AT logic, VO 
logic and SVO logic. SVO logic absorbs the advantages of BAN logic, GNY logic, 
AT logic and VO logic and integrates them in one logic system. In the aspect of for- 
mal semantics, SVO logic redefines some concepts as distinguished from AT logic, 
thereby canceling some restrictions in AT logic system. 


4.1 SVO Logic 


The marks that SVO logic uses are similar to BAN logic and there are 12 special 
symbols in all. The formal analysis of a security protocol by SVO logic can be di- 
vided into three steps: the initialization assumption set 2 of the protocol is given first 
and then the goal set I" that the protocol may or should reach is given; and finally, 
prove if the conclusion Q|-I is tenable in SVO logic, if it is tenable, it shows that 
the protocol reaches the expected design goal and the design of the protocol is 
successful. 

Using SVO logic can analyze not only all kinds of authentication protocols but also 
the secrecy of non-repudiation protocols which find an increasingly extensive applica- 
tion in the electronic commerce successfully. SVO logic obeys two inference rules 
and ten axioms [6]. The two inference rules are as follows: 
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MP rule: yw can be deduced from g andgdy; 
Nec rule: - P| = q@ can be deduced from - Q. 


Here, g and ware formulas, P represents a subject and + g represents that Mis a 
formula which can be deduced from an axiom. 

The ten axioms are trust axiom, message source axiom, key agreement axiom, ac- 
ceptance axiom, message possession axiom, message comprehension axiom, jurisdic- 
tion axiom, message freshness axiom, temporary value verification axiom and “good” 
shared key symmetry axiom. Only those axioms which are related to authentication 
are listed here. 


4.2 Formal Analysis of NA 0 protocol 


To verify the security of the modified protocol, now use SVO logic to make a formal 
analysis. Analyze the initiating party A and receiving party B of the protocol as 
follows: 


For subject A. The initialization assumption set about subject A[7]: 
P;: A=PK,(,K,), 


= A3(R,,R,.%.x), 
= SV({B.R; fy. K;.(B.R,)) 


P>: 


P3: 
Py: Al = EV(N,,B,N,),KapstNaBNote.,,)> 

Po: Al =#(R,). 

P;: AS((T| PKs BR, AAS BR, (BR tea % A 


Al 
Al 
Al 
Ps: Al=PK5(A,(R,.R,))> 
A 
A 


EV N, BON, Ka (Na B Note, >PKs Bo Ry", 
Ps Aa BR, \B Re fe R, (Na B Nytk, > 
Py: AJZA< BR, {B Ry be RNB Noe, «3 
Pwo: Al= T| BR, >T| PKs BR, , 
Py: Al= Bl) ON, B N, > Bl)= BAB 
Ba Ky, ABl=t Ky, 


P,...P; reflect the initial beliefs of subject A, Pg receives messages, P) comprehends 
messages and Pio and P,; interpret messages. In addition, there is a supplement to 
SVO logic, namely introducing EV X, K, Y to express that the result of encrypting 
X by an encryption key K is Y. 
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Protocol goals[8]: 
Gi. A=<“*#*>B 
Gy: Al=#(K.,) 


Use rules and axioms to infer. 
The results got by all steps are written out first and then the rules, axioms, formulas 
and initialization assumptions used for deducing the results are given. 


(1) A| =(A< 1B.R, f- ) can be got by Pg, acceptance axiom and Nec rule. 


(2) Al=(T| B R, can be got by formula (1), P;, P3; and message source 
axiom. 

(3) A| = T| PKs; BR, canbe got by formula (2), trust axiom and MP rule. 

(4) Al =PK; B (R,,**» )) can be got by formula (3), Po, Py, trust axiom, P; and 
MP rule. 

(5) A| = Ac ““_» Bcan be got by formula (4), Ps, key agreement axiom, trust 
axiom and MP rule. K,, is shown as formula (6). 

(6) Ka, = Fo(R,.R,.Ry.*,) =(R,)* -(R,)* = (R,)” (R,)” = 8" (mod P) 


(7) A| =#(K_,,) can be got by formula (6), P. and message freshness axiom. The 
goal G, is achieved. 

(8) Al =A> (R,>*,) can be got by Po, message possession axiom, trust axiom and 
MP rule. 

(9) Al =A>K.,,, can be got by formula (8), P2, formula (6) and message posses- 
sion axiom. 


(10) A| = A< “42-5 B can be got by formula (5), formula (9), the definition of 


Ac “at 5 B , trust axiom and MP rule. 
(11) AJ=(A {N, ,B,N, te , ) can be got by Ps, acceptance axiom and Nec rule. 


(12) Aj= Bl N, B N, canbe got by formula (5), formula (1) and message 
source axiom. 

(13) Aj= Bl B>K,, can be got by formula (12), Pi, trust axiom and MP 
rule. 

(14) Al= B = Bo>dK,, can be got by formula (7), message freshness axiom, 


formula (13), temporary value verification axiom, trust axiom and MP rule. 
(15) A| = A«<w* _, B can be got by formula (10), formula (13) and the definition 


of Ac #* > B . The goal G, is achieved 


It can be known from formula (7) and formula (15) that protocol goals G; and G2 have 
been achieved. 
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For subject B. The combination of the first message and the second message that 
subject B receives corresponds to the message that subject A receives. It can be got by 
symmetry that the protocol can achieve the following goals[9]: 


G3, B= B¢Se#+>A 
G,: Bl =#(K,,) 


It can be known from I and II that the modified protocol meets security goals. 

The conclusion shows that after the modified A 0 protocol is executed success- 
fully, both subject A and subject B believe that K,, is the session key of freshness 
owned jointly by them and unknown to the others. So it is said that the improved 
A 0 protocol completes a definite key authentication, thereby reaching ideal au- 
thentication goals. 


5 Conclusion 


This paper discusses the design and formal analysis methods of authentication secrecy 
in non-repudiation protocols, puts forward an improved NA 0 protocol and verifies 
its authentication and secrecy. It is to be noted that the existing formal analysis me- 
thods are still far from perfect, it’s because they can only find the shortcomings of 
protocols but can not ensure trouble-free protocols after analysis necessarily secure 
and non-attacking. And therefore, the existing formal analysis methods remain to be 
further deepened and perfected. 
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Abstract. Based on the characteristics of nonlinearity, large delay, strong 
coupling, etc, the fuzzy control method was selected as the control strategy for 
the system. According to the analysis and research about fuzzy control theory 
and greenhouse environment, temperature-humidity deviation and deviation 
variation rate of the system were selected as I/O variables and would be fuzzi- 
fied, and a fuzzy controller was designed for the greenhouse environment moni- 
toring system. After that, a dynamic greenhouse environment model was 
constructed and simulations were carried out. The analytic results got a good 
agreement with the experiment data, which proved the accuracy of the model 
and the feasibility of the control strategy. In the end, the outcome of fuzzy con- 
trol was compared with that of PID control. It was confirmed that the fuzzy 
control with a smaller overhoot and shorter adjusting time is superior to the PID 
control. 


Keywords: greenhouse environment; fuzzy control; dynamic greenhouse mod- 
el; PID control. 


1 Introduction 


The greenhouse, a place that can create the best conditions for plant growth and avoid 
the effects of external seasonal variations and bad weather, is an important component 
of modern agriculture[1]. With the economic development, technological advances 
and importance attached to energy saving, the traditional greenhouse production tech- 
nology has been unable to meet the needs of agricultural development, which is main- 
ly reflected in the management and control strategy of greenhouse environment. This 
paper mainly studied the control strategy. The fuzzy control method was used as the 
system control strategy, the fuzzy controller for greenhouse environment monitoring 
system was designed, and the dynamic greenhouse model[2] was obtained. The re- 
sults of simulation experiment on the system carried out by Matlab/Simulink verified 
the accuracy of the model and the feasibility of control method. Meanwhile, the com- 
parison with PID control was made. 
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2 Design of System Controller 


The greenhouse system is a large system, which has the characteristics of strong mu- 
tual coupling and time-varying between nonlinearity, time-varying, delays, uncertain- 
ty, multiple objectives, and control parameters. Therefore, an accurate mathematical 
model is difficult to be established for the greenhouse environment monitoring sys- 
tem, and traditional control method adopted can not effectively control the green- 
house environment. 

The fuzzy control does not require an accurate mathematical model established the 
controlled object, has better response time, system stability and robustness, and is 
ideal for greenhouse control system. Therefore, this paper focuses on researching the 
control function of fuzzy control method on the environment state. 


2.1 Simplification of Fuzzy Controller 


The fuzzy control theory is a nonlinear control [3] based on fuzzy sets theory and 
fuzzy linguistic variables and fuzzy reasoning. Currently, the fuzzy control theory has 
become widely used in single-input single-output (SISO) system, but for complex 
control system, the multiple-input multiple-output (MIMO) variable system with 
strong coupling is often encountered. For the fuzzy controller, the control rule in- 
creases exponentially with the increase of input, but too many control rules will make 
fuzzy controller become too complex and difficult to control. In the case of multiple 
variables, the structure of fuzzy controller shall be simplified to reduce fuzzy control 
rules. For the MIMO fuzzy controller, its rule has the following form: 


1 2 
R=\ PC cv Pe (1) 


Where: Rien if x is A;, and ... and y is B;, then z, is Cj, ..., Z, iS Cig. 
The antecedent (input and preconditions) of Risnyo is a fuzzy set in the direct 


product space X x ... x Y, the seccedent (conclusion) is the combination of q control 
actions which are mutually independent. Therefore, i-rule can be expressed as the 
following fuzzy implication formula, i.e.. 


Roe: ( A;x...xB;) —( Cit...+Cig) (2) 


2.2 Design of Fuzzy Controller 


The fuzzy controller is the core of the fuzzy control system. The design of fuzzy con- 
troller usually includes the following items: (1) determine the input and output va- 
riables of fuzzy controller (i.e., determine the control volume); (2) develop fuzzy 
control rules; (3) fuzzy quantifying is made on the control variables; (4) selection of 
the universe of discourse and determination of quantization factor, scale factor and 
other parameters. 

The main factors affecting the greenhouse environment are temperature, humidity, 
light intensity, CO concentration, etc. The temperature and humidity have the most 
obvious effects on the greenhouse, so the system made specific research on them. The 
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temperature and humidity control system in the greenhouse environment is a multiple- 
input multiple-output (MIMO) fuzzy control system, and four inputs and seven out- 
puts need to be considered[4]. According to the principle that the MIMO system can 
be simplified into several multi-input single-output (MISO) systems, a multiple of 
MISO fuzzy controllers can be used to replace the MIMO fuzzy controller, to reduce 
fuzzy control rules and simplify the design of the controller. 

The control of greenhouse temperature and humidity has relative independence, so 
the system uses temperature and humidity fuzzy controllers to solve the issue of too 
large control rules caused by multiple inputs. The following analyzes the process of 
building the fuzzy controller of the greenhouse system based on the example of tem- 
perature control. 


Fuzzification of input and output. In temperature control, the system selects the 
temperature deviation e (f) and temperature variation rate ec (f) as input variables, and 
the corresponding fuzzy sets are FE and ECr. The temperature deviation within setting 
value + 1°C is fuzzy control area, i.e., the basic domain of discourse is [-1, 1]. The 
value beyond the domain is treated as the boundary value, the quantification domain 
is [-6, 6], so the quantization factor Ket = 6; the temperature variation rate EC; = 
dE7/dt reflects the variation trend of temperature deviation in on-site state. The basic 
domain of discourse is [-1, 1], the quantification domain is [- 6, 6], so the quantifica- 
tion factor is Kect = 6. 

The fuzzy sets E-and EC;of both the temperature deviation and temperature varia- 
tion rate are expressed by seven fuzzy states, namely PB (positive big), PM (positive 
middle), PS (positive small), ZO (zero), NS (negative small), NM (negative small), 
and NB (negative big). The shape of antecedent membership function of fuzzy rules 
has small impacts on control performance, but the size of breadth has larger impacts 
on performance; the breadth of seccedent membership function has small impacts on 
control performance, so the system used triangular membership function to reduce the 
calculated amount of the system. 

The temperature output control includes heating and cooling, the output fuzzy sets 
fuzzy Ur is expressed by 6 fuzzy states, namely PB, PS, ZO (moderate), NS (light 
cooling), NM (moderate cooling) and NB (heavy cooling). The heating is expressed 
by two fuzzy states. PB means the rapid heating of stove, and PS means slow heating. 
The cooling of greenhouse system is composed by three states: substantial cooling, 
moderate cooling and slight cooling, and the output membership function uses trian- 
gle distribution. 


Establishment of control rule. The basic idea of establishing fuzzy control rules is 
that when the error is large or relatively large, it focuses on selecting controlled quan- 
tity to eliminate the error as soon as possible; while when the error is small, the selec- 
tion of controlled quantity shall avoid overshoot, with the stability of the system as 
the main premise. According to the above principles, combined with the correspond- 
ing control rules, the error can be eliminated by judging the deviation E; and devia- 
tion rate EC; during the temperature adjustment. The winter control rules can be 
described by fuzzy condition statement as follows. 
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IF E=NB AND EC=PB THEN U=PB; 

IF E=NB AND EC=PM THEN U=PB; 

IF E=NB AND EC=PS THEN U=PB; 

IF E=NB AND EC=ZO THEN U=PB; 

IF E=NB AND EC=NS_ THEN U=PB; 

IF E=NB AND EC=NM_ THEN U=PB; 

IF E=NB AND EC=NB_ THEN U=PB; 
49 winter fuzzy control rules can be obtained respectively by analogy. Similarly, the 
summer fuzzy control rules can also be obtained. 


Clarification of fuzzy variables. There are many clarification methods for the fuzzy 
variables, and the most commonly used methods are the maximum membership de- 
gree method, median judgment law and the weighted average method[5]. The system 
uses the weighted average method for the clarification of variables. In general, the 
decision of weight coefficient is related to the system response. Therefore, the appro- 
priate weighting coefficient can be selected according to the system design require- 
ments or experience. 


3 System Simulation 


The establishment of simulation on the field monitoring system of greenhouse envi- 
ronment is based on the environmental dynamic model, and is the validation on the 
control effects and feasibility of control rules of the designed fuzzy controller. 


3.1 Dynamic Greenhouse Model 


The greenhouse system is generally divided into five components: soil layer, crop 
layer, heating layer, indoor air layer and greenhouse covering layer. By learning from 
the theoretical results of modeling environment and climate at home and abroad, and 
considering the control function of actuating mechanism, a specific temperature dy- 
namic model [6][7] of greenhouse environmental climate is obtained shown as 
follows: 


vpCp = =Q, + Orie + Ovens ae Q.. an Q; + Qeoi1 +Qiae> Ori aa Qtran _ Qp (3) 


Where, v is the volume of greenhouse (m*), p is the air density (kg/m*), Cpis the heat 
content in the air (Jkg'K"'), and Q is energy. 

This paper conducts studies and system simulation on the above model based on 
the greenhouse environment under winter weather conditions. When the outdoor tem- 
perature in winter is low, the skylight is basically closed, without considering venting 
heat exchange, while ignoring the minor impacts of blade surface heat transfer, photo- 
synthesis, and transpiration on greenhouse[8]. The equation (3) can be simplified as: 


vpCp aa =Q, a Oneater 1 QO. oF QO, (4) 
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Where: Q, - solar radiation energy, Oneater - heating energy, Q,- energy for external 
thermal conduction, Q; - long-wave radiation energy. 
Table | shows the parameter table of greenhouse environment in the model. 


Table 1. Parameter Table 


Parameter Symbol Value 
Greenhouse volume v 1048.32 m° 
Surface area of greenhouse As 305.68 m? 
covering materials 
Light transmittance of glass T 0.89 
Heat transfer coefficient of K, 297A T° ?Wm- 
covering materials 2K! 
Air emissivity | 0.90 
Glass emissivity &2 0.90 
Air density p 1.2 kgm? 
Heat content in the air Cp 1006 Jkg'K"! 
Stefan-Boltzman Constant o 5.67x10°wm7k* 


3.2 System Simulation 


The system conducted simulation experiment on the built model by using software 
Matlab/Simulink to verify the control effect of fuzzy control strategy and the feasibili- 
ty of control rules. According to the dynamic model of the greenhouse environment, 
the block diagram of establishing Simulink simulation of the indoor temperature 
control system was shown in Figure 1. The system input was composed by indoor 
temperature deviation F and temperature variation rate EC; the system setting temper- 
ature was added into simulation for the selection of fuzzy control rules; E and EC 
output control quantity to regulate greenhouse heating system through fuzzy control, 
and output the control quantity to the workspace of Matlab. 
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Fig. 1. Simulation Block Diagram of Winter Simulink 


Since the outdoor temperature is low in winter, the wet curtain fan is generally not 
opened for cooling. According to growth requirements of crops, the setting value of 
control system temperature is 22°C during the day and 10°C during the night. The 
simulation experiment selected actual climate data on December 23, 2007, and intro- 
duced them into the greenhouse model in the form of Excel sheet as the basis for the 
computer simulation of greenhouse environment. Figure 2 shows outdoor climate 
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conditions of a day, including outdoor temperature, humidity and light intensity. The 
equation of the model is integrated by Runge - Kutta algorithm with fixed time and 
step. Figure 3 is the comparison chart of computer simulation temperature and actual 
temperature. The figure shows that in the temperature simulation curve, the lowest 
controlled indoor is about 10°C, and the maximum temperature is less than 26°C. 
Meanwhile, the optimum temperature for the growth of general crops is between 20°C 
~ 30°C, so the above fuzzy control method used for indoor temperature control can 
meet the needs of crop growth. From the comparison of the simulation curve and the 
actual curve, the actual curve had a good agreement with the simulation curve, which 


verified the accuracy of the dynamic greenhouse model and the feasibility of fuzzy 
control strategy. 
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Fig. 2. Outdoor Climate Condition 
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Fig. 3. Comparison Chart of Fuzzy Control 
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Finally, the control effects of both fuzzy control and PID control are compared. 
The mathematical model of temperature parameters of greenhouse environmental 
control can be approximated as first order pure delay plus disturbance model. The 

2s 


transfer function of first-order pure delay system is assumed as G(s) = al by 
S 
using unit step input, and the software Matlab/Simulink is used for simulation. 
Figure 4 shows the comparison chart of control effects of the two controls. It can be 
seen from the figure that from the comparison of simulation curve obtained by using 
fuzzy control method and that obtained by using PID control, the former had smaller 
overshoot, shorter adjustment time, and better control effects than those of conven- 
tional PID control, verifying the rationality of fuzzy control being used as the system 


control strategy. 
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Fig. 4. Comparison of PID Control and Fuzzy Control 


4 Conclusion 


This paper studied the control strategy of monitoring system for greenhouse environ- 
ment, and focused on introduction of the design of temperature fuzzy controller, in- 
cluding the structure of fuzzy controller, selection of input and output membership 
function, and the establishment of fuzzy control rules. The fuzzy control algorithm 
was simulated, and the feasibility of the control method and the accuracy of the model 
were validated. In the future, with the continuous development of fuzzy control tech- 
nology, the improved fuzzy control technology can be used for the further optimized 
research of control strategy of greenhouse control system to achieve better control 
purposes. 
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